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Abstract 


This thesis is concerned with nonlinear control of an Inertia Wheel Pendulum using passivity 
approach. The Inertia Wheel Pendulum (IWP) model is one of the simplest prototypes for 
underactuated systems to test variegated nonlinear control strategies. 

IWP system dynamics has two equilibrium points of which the top unstable equilibrium 
point offers challenge to a control engineer. The control problem is often defined as swing¬ 
ing up the inertial wheel from downward bottom stable equilibrium position towards the top 
unstable equilibrium point where the wheel must stay stable forever. Implementation of a 
simple state feedback controller shows that the classical linearized controller can stabilize the 
pendulum at the top unstable equilibrium point if deflection angle \ Xgi \< 0.12radian from 
the equilibrium position. Thus the problem is redefined if there exists a linearized control 
that can stabilize the pendulum when the deflection angle | |< ‘^radian. It turns out that 

the control startegy using input-state feedback linearisation can stabilize the pendulum if the 
initial position of the pendulum \ Xgi \< ^radian. However, the original control problem is 
still not solved since the pendulum has to be brought from the downward bottom equilibrium 
position to the range defined by | |< ^radian. Thus came the idea of swing up control as 

proposed by Spong [1]. 

A passivity based controller is designed where the IWP system is considered of consisting 
of two subsystems, namely, pendulum subsystem and wheel subsystem. The output variable 
is identified for each subsystem so that each subsystem reamins passive from input to out¬ 
put. A control law is then proposed so that the combined system of pendulum and wheel is 
also passive. Further analysis shows that this proposed control law can only swing up the 
pendulum from the downward bottom stable equilibrium position to top unstable equilibrium 
position. A compensation term is added to take into account of friction during swing up which 
increases the amplitudes of swing up oscillations. However, the control law fails to stabilize 
the pendulum at the top unequilibrium position due to the fact that the controller brings the 
trajectory into a homoclinic orbit which intersects both stable and unstable equilibrium points. 
Thus the balancing controller is introduced to stabilize the linearized system at the top. 

Another passivity based controller named Interconnection and Damping Assignment pas¬ 
sivity based control is revisited. This method represents system dynamics in Hamiltonian 
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formulations instead of usual Euler Lagrangian (EL) formulation.. In Hamiltonian model, the 
system dynamics are parametrized by two matrices, called interconnection matrix and damp¬ 
ing matrix. The controller is designed by chosing suitable closed loop interconnection and 
damping matrices for a system. The approach involves estimation of an extra energy function 
term Ha that must be added to open-loop energy function which makes the system passive 
in closed loop. The controller is parametrized in terms of partial differential terms involving 
estimated Ha- The stability of this controller algorithm has been proved using La Salle’s in¬ 
variance principle. Although this control algorithm can take the pendulum from its downward 
bottom equilibrium position to upward top equilibrium position at one go in simulation, the 
real time experiment can not be done using this algorithm. This is because the algorithm de¬ 
mands very high transient torque and speed demand for the inertial wheel which the available 
mechatronics control kit in the laboratory can not meet with. In essence, many possible theo¬ 
retical algorithms can not be implemented in real time because of limitations in the hardware 
set up. 

It could be ascertained during this work that dual mode controllers are best suitable for 
real-time implementation. This work ends with a novel proposal of swing up control using 
pumping of energy concept. Earlier, energy based swing up controllers have been proposed 
which could work in simulation, but failed during experimentation due to presence of friction. 
However, the proposed energy based swing up controller in this work works both in simulation 
and experimentation. 


IV 



Contents 


1 Introduction 1 

1.1 Introduction. 1 

1.2 Feedback Linearisation. 2 

1.2.1 Input-Output Linearization. 2 

1.2.2 Input-State Linearization. 3 

1.3 Sliding Mode Control. 3 

1.4 Backstepping. 5 

1.5 Passivity Based Control. 5 

1.6 Underactuated systems. 7 

1.7 Present Work. 8 

2 Inertia Wheel Pendulum (IWP) 10 

2.1 Introduction. 10 

2.2 The Model. 10 

2.3 The Experimental Setup ..'. 13 

2.3.1 The Mechatronics Kit Hardware. 13 

2.3.2 The Software. 14 

2.3.3 System Parameters. 14 

2.3.4 Some Considerations during Experimentation. 15 

2.4 Linearisation . 16 

2.4.1 Analysis of Linearized Model. 16 

2.4.2 The State Feedback Control Law for Linearized System. 17 


V 






















2.4.3 Simulation and Experimental Results. 19 

2.5 Concluding Remarks. 22 

3 A hybrid Control of IWP 23 

3.1 Introduction. 23 

3.2 Feedback Linearisation. 23 

3.2.1 Mathematical Preliminaries. 24 

3.2.2 feedback linearisation or Input-State Linearisation. 27 

3.2.3 Feedback linearisation of IWP. 30 

3.2.4 Control Design. 32 

3.2.5 Simulation and Experimental Results for Feedback Linearization ... 33 

3.3 Partial Feedback linearization .. 36 

3.3.1 Analysis . 36 

3.3.2 Control Design. 37 

3.3.3 Simulation and Experimental Results of Complete Stabilization ... 37 

3.4 Concluding Remarks. 40 

4 Passivity Based Control of IWP 42 

4.1 Introduction. 42 

4.2 Dissipativity and passivity. 42 

4.2.1 Passive systems and their properties. 44 

4.3 Passivity Based Control (PBC) of IWP. 47 

4.3.1 Effect of friction on Controller. 50 

4.4 Simulation and Experimental Results of Passivity Based Swing up. 51 

4.5 Experimental Results for Complete stabilisation. 55 

4.6 Concluding Remarks. 55 

5 IDA-PBC Control of IWP 57 

5.1 Introduction. 57 

5.2 Port Controlled Hamiltonian systems. 58 

5.2.1 Port-controlled Hamiltonian model from EL and Hamiltonian model. 58 


VI 

























5.2.2 Basic properties of Port-controlled Hamiltonian systems. 61 

5.3 Interconnection and Damping Assignment PBC. 62 

5.4 The Control Problem in the Context of Passivity. 64 

5.5 IDA-PB Controller design . 65 

5.5.1 Application to mechanical systems. 68 

5.5.2 Inertia Wheel Pendulum System. 73 

5.5.3 Simulation Results for IDA-PBC on IWP. 81 

5.5.4 Simulation and Experimental results of Balancing IWP by IDA-PBC 84 

5.5.5 Swing up Controller. 88 

5.5.6 Simulation and Experimental results for Complete stabilisation .... 91 

5.5.7 Concluding Remarks. 91 

6 Conclusion 96 

6.1 Summary. 96 

6.2 Scope of Future Work. 97 


vii 














List of Figures 


2.1 The Inertia Wheel Pendulum. 11 

2.2 Simulation and Experimental Results For Linearised Control (Xqx ~ Xqx ~ 

t, Xq2^t) . 20 

2.3 Simulation and Experimental Results For Linearised Control (Xq 2 ^ t, u 

^ql ^ ^ql) .. 21 

3.1 Simulation and Experimental Results of Feedback Linearization (xqi t, Xqi ~ 

t, Xq2^t) . 34 

3.2 Simulation and Experimental Results of Feedback Linearization (Xq 2 

t, Xqi ~ 35 

3.3 Simulation and Experimental Results of Complete Stabilisation {xqi ~ t, Xqi ~ 

t, Xq2^t) . 38 

3.4 Simulation and Experimental Results of Complete Stabilisation {xq 2 U'^ 

t, Xqi^ Xqi) . 39 

4.1 Output feedback passive system. 45 

4.2 Parallel interconnection systems. 45 

4.3 Feedback interconnection systems. 46 

4.4 IWP system as a parallel interconnection. 47 

4.5 Simulation and Experimental Results of Passivity based swing up (x^i ~ 

t, Xqi t, Xq2 t) . 52 

4.6 Simulation and Experimental results of Passivity Based swing up (Xq 2 ~ 

t, u^t, E^t) . 53 
















4.7 Experimental results of Passivity Based swing up with LSF balancing control(a;,i ~ 
i, igl ~ i, Xq 2 ~ t, Xq 2 t, U t, E ^ t) . 56 

5.1 Simulation results of IDA-PBC (Xqi ~ t, Xgj ~ t, Xq2 ~ t, Xq2 t, u 

t, Hd^t) . 83 

5.2 Simulation results of IDA-PBC (Hp t, Xqi ^ Xgi . 84 

5.3 Simulation and Experimental Results of Balancing IWP by IDA-PBC (a:,! ~ 

i, ^qi ~ 0. 86 

5.4 Simulation and Experimental Results of balancing IWP by IDA-PBC (Xg 2 ~ 

t, Xq2 ^ t, u t) . 87 

5.5 Simulation and Experimental Results of balancing IWP by IDA-PBC (Hd ~ 

t, Hp ~ t, Xqi ~ a:<ji) . 88 

5.6 Simulation and Experimental results for Complete Stabilization (Xgi ~ t, Xgi ~ 

t, Xq 2 ^t) . 93 

5.7 Simulation and Experimental results for Complete Stabilisation {Xq2 ^ t, u ^ 

t, Hpr^t) . 94 

5.8 Simulation and Experimental results for Complete Stabilization {Hd ~ t, Xqi ~ 

Xql) . 95 












Chapter 1 
Introduction 


1.1 Introduction 

Natural systems are inherently nonlinear. Studies in nonlinear systems have thus fascinated 
researchers from many fields. As one builds artificial systems mimicking natural systems, not 
only one faces the challenges of dealing with nonlinear dynamics, but also with the problem 
of dealing with numerous actuators (biological motor controls) that natural systems do have. 
Thus comes the motivation to design underactuated systems although such systems are even 
more difficult to control as compared to fully actuated nonlinear systems. One often comes 
across many underactuated systems examples such as underwater vehicles, helicopters, air¬ 
crafts, hovercrafts, satellites and walking robots. Although there exists variegated control 
algorithms for fully actuated nonlinear systems these algorithms are not directly applicable 
to underactuated systems. More over, a general framework has not evolved to describe all 
possible underactuated systems. Hence the control problem for an underactuated system that 
can be dealt with has to be system specific since it can not be generic at the present moment. 
One should be aware of various nonlinear control strategies already available in the literature 
before designing controllers for underactuated systems. Some of the nonlinear control tech¬ 
niques that are relevant to the design of controllers for underactuated systems are feedback 
linearization, sliding mode control, backstopping and passivity based controllers. 
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1.2 Feedback Linearisation 


The idea of feedback linearization is to transform a nonlinear system dynamics into a linear 
one using (dynamic) feedback. The advantage is evident that once the system is transformed 
into linear one, all linear control techniques can be applied. The theory of linearization is 
developed from Differential Geometry (details in [2, 3] .There are basically two kinds of ap¬ 
proaches to linearization, namely, 

1. Input-Output linearization 

2. Input-State linearization 

1.2.1 Input-Output Linearization 

Consider a single input single output (SISO) nonlinear system 

X = f{x)+g{x)u 

y=^h{x) (1.1) 

In the above equation (1.1) output is related to input through state. The objective is to bring 
out direct linear relationship between input and output, 

= V ( 1 . 2 ) 

where v = 'y{x, u),'m £ R indicates differentiation by m times and m <n and m is known 
as relative degree of the system (1.1). 

If in the case m <n, then the transformed system (1.2) will have unobservable dynamics 
of order n — m. The unobservable dynamics, named as zero dynamics by Isidori et al. in [3], 
are unstable then this technique fails. 

MIMO systems are also dealt in the same way with bit more complexity (see [4]). This 
technique has been successfully implemented in several practical applications. 
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1.2.2 Input-State Linearization 

In this approach, the objective of linearization is achieved in two steps. State transformation 
and Input transformation. Moreover in this approach we are not concerned with output. The 
formal definition of the problem (in SISO case) is given in [2]. Consider the system (1.1a), 

X = f(x) + g{x)u 

the objective is to findout a state and input transformations, 

V = q{x) + s{x)u 

z = T{x) (1.3) 

so that the original system is transformed into linear one, 

z = Az + Bv (1.4) 

Infact in the equation (1.2), relative degree, m = n, order of the .system (1.1) then Input- 
Output linearization leads to Input-State Linearization with state trans- formed from x y 

Control design using feedback linearization involves partial differential equations and 
solving these equations is really difficult particularly in the case of MIMO systems. Still 
the topic is under study and the research is underway to overcome the problem of solving 
partial differential equations. 

1.3 Sliding Mode Control 

The underlying principle behind Sliding Mode Control (SMC) is to alter the systerri dynamics 
via the application of a high speed switching control, hence it is also known as Variable 
Structure Control. Infact this approach is also a state feedback scheme but feedback gains are 
discontinuous functions of time. 

This technique has drawn interest among control research community since the publica¬ 
tion of the survey paper [5] by Vadim I. Utkin in 1977. This technique is pretty old but still 
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in application because of its advantages, simplicity and inherent ability to take care of uncer¬ 
tainties present in the model parameters once we know the bounds on uncertainty. 

On the negative side it suffers from the so-called chattering phenomenon. Chattering phe¬ 
nomenon is generally perceived as motion which oscillates about the sliding manifold. Many 
analytical design methods were proposed to reduce the effects of chattering [6]. Secondly, the 
exact performance of zero error can not be achieved as it needs infinitely fast switching. So 
trade-off between the error tolerance and switching frequency is to be made. Here we give a 
brief idea about control design. Consider again the system (1.1). 

* = f{^) + 9ix)u 

The basic methodology is to choose a stable one dimensional manifold in the whole state space 
and choose the control input in such a way that the state evolves along the stable manifold 
called sliding surface. For the above system sliding surface can chosen to be 


S — fln^n On—l^n— 1 + * '' 4 - diXi = 0 (1-5) 

where tracking error,Xj = Xj — x^ and x’^ represents the desired trajectory. Note that the 
unique solution of first order differential equation is 5 = 0. Hence the problem is simplified 
as one has to select a suitable control input u so as to keep s = 0 to achieve the desired 
tracking performance. Let us select a control action such that 

55*' < -ilsl (1-6) 

where rj is strictly positive constant. 

The above equation tells that if the system state is going away from the manifold s = 0, then 
the control action pushes the states towards the same manifold s = 0. 

The sliding mode technique has been successfully applied to various non-linear systems 
like robot manipulators, control of automotives, high-performance electric motors. 
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1.4 Backstepping 

Backstopping is another alternative technique to design stabilizing controllers for nonlinear 
systems. This technique is motivated by the fact that first order systems are easy to deal with 
and can be used as starting points of recursive design for higher order systems. 

A quantitative treatment of this technique is not provided here since the backstepping 
concept has been elaborated in chapter 2. For detailed treatment of backstepping technique 
one can refer the book by ICrstic, Kanellakopoulos and Kokotovic [7]. The approach primarily 
makes use of Lyapunov stability theory and recurisvely designs controller starting from first 
order to higher order dynamics. 

Backstepping finds great scope in the field of adaptive and robust control [8]. Neural 
network based robust backstepping techniques [9] have been proposed which eliminates the 
tedious calculation of regression matrices. In addition such neural control schemes do not 
demand that nonlinear systems should be linearly parametrizable - the dynamics is linear in 
terms of unknown parameters - for designing adaptive control schemes. 


1.5 Passivity Based Control 

Passivity is a fundamental property of many physical systems which may be roughly defined 
in terms of energy dissipation and transformation [10]. It is an inherent Input-Output property 
in the sense that it quantifies and qualifies the energy balance of a system when stimulated 
by external inputs to generate some output. Passivity is therefore related to the property of 
stability in an input-output sense, that is the system is stable if bounded input energy supplied 
tO' the system yields bounded output energy. Passivity based control is a methodology which 
consists in controlling a system with the aim at making the closed loop system passive. 

The development of passivity and the connections with feedback stabilization of a system, 
from general operator theoritic point of view, were introduced in the work of Willems [11]. 
A different research line was initiated by work of Wu and Desoer [12]. The development 
of passivity concepts, in relation to stability, was undertaken in the work of vidyasagar [13] 
nnd 2]ames [14] .The extension of Willem’s results to the case of nonlinear systems, which 
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are affine in the control input, was given by Hill and Moylan in [15]. The idea of making a 
system passive by means of a state static feedback and all the related geometry was given in 
the work of Byrnes etal [16}. These results were later complemented, in connection with the 
concept of feedback positive systems, in the work of Kokotovic and Sussman [17]. Definite 
connections between passivity, Lyapunov-based control and inverse optimal feedback can be 
found out in the book by Sepulchre er a/ [18]. The conditions under which a nonlinear system 
is rendered passive, via state static state feedback were extended by Santosuosso [19] to the 
case of affine feedthrough system. The study of feedback dissipativity, for linear case, has 
been addressed in the work of Picci and Pinzoni (see [20]). The corresponding developments 
for the passivity of discrete time nonlinear systems were given by Lin and Lin and Byrnes in 
[21]. For a global perspective area these books can be refered, by Selpulchre et al [18], Ortega 
et al [22] and Khalil [23]. The passive system is a system that can not store more energy than 
the energy that is supplied to it. In other words passive system does not contain any energy 
source. Consider the generalised non-linear system 

x{t) = f{x{t)) + g{x{t))u{t) 

y{t) = h{x{t)) + j{x{t))u{t) (1.7) 

where x G R", u,y £ R'”, and the nonlinear functions, defined by are 

smooth. Let us call S(u{t), y{t)) the supply rate, such that Vu, Var(O) 

/q 1.9(u(i),2/(<))|cif<oo iGR+ 

Definition 1.1 (Dissipativity) A system of the form 1.7 is said to be dissipative with respect 
to the supply rate S{u,y) if and only if there exists a storage function H{x{t)) defined as 
H {x{t)) = K + P, (where K is the kinetic energy and P is the potential energy of the system) 
such that 

H{x{t))<H{x{0))+ [ S{u{t),y{t))dt (1.8) 

Jo 

Vw and^x{0). 

This is called dissipation inequality, which means the storage energy function H{x{t)) at a 
future time t is not bigger than the sum of the available energy function H(x{0)) at an initial 
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time 0 plus the total energy J^S{u{t), y{t))dt supplied to the system from external source in 
the interval [0, f]. There is no internal creation of energy. 

Passive system represent an important subset of dissipative systems. 

Definition 1.2 (Passivity) An EL system of the form is passive if it is dissipative with supply 
rate S{u, y) = vFy. Because the supplied energy is greater than or equal to the total energy 

H, the passivity property can also be written as 

H{x{T))-H(x{0))< rvF{t)y{t)dt (1.9) 

Jo 

Vu(t), VT > 0 and all H(x{0)) 

I. 6 Underactuated systems 

The formal definition of underactuated system is as follows [24]: 

Definition 1.3 Underactuated systems are the systems of the form 

Q = f{Q,q) + G{q)u (1.10) 

where, q is the state vector of independent generalised coordinates, (also called as vector of 
configuration variables), 

f{') is the vector field representing the dynamics of the systems, 
q is the generalised velocity vector, 

G is the input matrix, and 
u is a vector of generalised force 


The dimension of q is defined as the degrees of freedom of the system in (1.10). In general 
(rather literally) a system is said to be underacmated if the external generalized forces are 
not able to command instantaneous accelerations in all directions in the configuration space, 
i.e. rank(G) < dim{q). Feedback linearization, sliding mode control and backstepping 
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schemes as discussed earlier are not directly applicable to underactuated systems. The La- 
grangian dynamics of these underactuated systems may contain feedforward nonlinearities 
[25], non-minimum phase zero dynamics, and other properties that place these class of sys¬ 
tems in forefront of research in nonlinear control. Here arises the question that why to go for 
UAMS with all the complications involved. The reason is as follows: 

1. In many practical cases actuators are expensive or heavy in weight, for these reasons 
some systems are underactuated by design. 

2. In some cases due to failure of some actuator fully-actuated systems may also become 
underactuated and then appropriate control algorithm can be used to still meet the re¬ 
quirement. 

3. Some physical properties like flexible (in joint) make the system underactuated inher¬ 
ently [26]. 

4. The working medium (water and space) adds one or more degrees of freedom where 
even fully actuated system behaves like an Underactuated System [27]. 

Thus there is a need to develop new control techniques applicable to underactuated nonlinear 
systems. The reasearch is focused towards obtaining generalized control algorithms for un¬ 
deractuated nonlinear mechanical systems. Since generalized objective is difficult to achive 
and also we are interested in stabilizing particular classes of mechanical systems, to simplify 
the task researchers have been studying simple mechanical systems. Some of the systems 
represent academic benchmarks and are pert of a standard control laboratory like the inertia 
wheel pendulum, pendubot, furuta pendulum, the planar manipulator with springs between 
the links, the ball and beam and the PVTOL aircraft. In spite of they are simple mechanical 
plants, they represent a challenge to the non-linear control community. 


1.7 Present Work 

The Inertia Wheel Pendulum (IWP) is one of the simplest prototypes to study underactu¬ 
ated systems. Usually researchers have adopted dual mode controller to control an inertia 


8 



wheel Pendulum. Although there are attempts to control IWP models using single control 
algorithm[28, 29], such control algorithms are not implementable in real time due to severe 
transient demands on the wheel actuation torque and the wheel speed. The dual mode control 
consists of two steps - swing up and balancing control. 

During swing up, the objective is to develop the oscillations that steadily increases in am¬ 
plitude so as to reach very near to top unstable equilibrium point. In other words, energy is 
pumped into the system during swing up phase to accelerate the system towards high ampli¬ 
tude oscillations. The energy that is being pumped in should be properly shaped so as to result 
in controlled oscillations for the pendulum to reach very close to the top unstable equilibrium 
point. 

Three different dual mode control strtegies have been disscussed, each consists of a swing 
up control and a balancing control. More emphasis is given to passivity based or energy based 
control. First.swing up control is based on partial feedback linearisation. Other two are energy 
based. The energy based swing up is achived by shaping pendulum system energy using two 
different representation of system dynamics. The first passivity based controller is designed 
using the system dynamics expressed in terms of Euler-Lagrange equations of motions. The 
other contol algorithm is designed using Hamiltonian system model. 

The objective of balancing controller is to balance the pendulum at top unstable equilib¬ 
rium point against disturbances. The first balancing control is a simple pole placement control 
by linearizing the system about unstable equilibrium point. The second one is based on input- 
state feedback linearization. And the third one is based on passivity approach in which the 
system is modeled as a Port Controlled Hamiltonian system with Dissipation (PCHD), the 
stabilization is achieved by shaping both kinetic and potential energy making use of intercon¬ 
nection of Hamiltonian structures and injecting damping into the system by negative passive 
output feedback. 

The controllers have been implemented in real-time on a Mechatronics control kit supplied 
by Quanser Inc. Experimental results are quite satisfactory. 



Chapter 2 

Inertia Wheel Pendulum (IWP) 


2.1 Introduction 

In this chapter, the Inertia Wheel Pendulum (IWP) model has been revisited which was first 
introduced by Spong[l]. It is one of the simplest non-linear underactuated systems with non- 
holonomic constraints ^ The Inertia Wheel Pendulum is a physical pendulum with a symmet¬ 
ric disk attached to the end which is free to spin about an axis parallel to the axis of rotation 
of pendulum. The disk is actuated by a DC-motor and the coupling torque generated by the 
angular acceleration of the disk is used to actively control the system. The Inertia Wheel Pen¬ 
dulum can be thought of as a simple pendulum in parallel with a torque controlled lumped 
inertia. Due to simlicity it can be used to illustrate a wide range of adyanced nonlinear control 
problems. 

2.2 The Model 

The Inertia Wheel Pendulum is a two degrees of freedom robot manipulator, of which pen¬ 
dulum constitutes the first link while rotating wheel is the second. The schematic diagram is 
shown in Figure 1.1, Since first link is provided with no actuator, the system is underactuated 
with one actuator at the rotating disk. The nonlinearity in system equations is due to the ap¬ 
plication of torque due to gravitational field. The equations of motion can be derived using 

‘Constraints which are not of the form2,4) = 0. They involve derivatives and double derivatives of 
the angular variables and Xq 2 and are not integrable. 
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the Euler-Lagrange (EL) formulation. The following system parameters are identified which 
are involved in the system dynamics. 

rUp Mass of Pendulum. 

rur Mass of wheel. 

m Combined mass of pendulum and wheel. 

Jp Moment of inertia of the pendulum about its center of mass. 

Jr Moment of inertia of wheel about its center of mass. 

Ip Distance from pivot to the center of mass of pendulum. 

Ir Distance from pivot to the center of mass of wheel. 

I Distance from pivot to the center of mass of pendulum and wheel. 

Angle of pendulum with the reference. 
ipr Angle of rotor with reference (as shown in figure2.1). 
r Motor torque input applied on disk (as shown in figure2.1). 



Figure 2.1: The Inertia Wheel Pendulum 
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The system parameters are redefined as follows: 


^ql — '4^p 
Xq2 = 'ipp+'4^r 
m — mp + nir 


ml = niplp + mrlr 

tj Jp “h TTiplp “t“ TD/j-l^ 


( 2 . 1 ) 


The kinetic energy K of the system is the sum of pendulum kinetic energy and rotor kinetic 
energy and can be written as 

1 - 1 . 

( 2 . 2 ) 




The potential energy of the system P is 


P = mgl{l — cosxgi) 


(2.3) 


where the potential energy energy is zero when pendulum is hanging in the downward posi¬ 
tion. Finally the Lagrangian function L is given by ' 


L = K-P 

1 1 

L = -Jxl^ +-Jrxl2-mgl{l-cosxgi) 


(2.4) 


The equation of motions are given using EL formulation as follows; 


dt \d'tp) dijj 


(2.5) 


where tp = [tpp, 'iprV'. Neglecting the frictional forces and electrical dynamics of the DC 
motor placed at the wheel, the torque vector is given as: 


0 

ktl 


( 2 . 6 ) 
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where kt is the torque constant of the motor and I is the motor current. The first element of 
the torque vector is zero since the pendulum link is not actuated. Simplifying equation 2.5, 
and using transformations given in equation 2.1, the final form of equation of motion is given 
as follows: 


Jxqi + mgl sin Xgi — —ktl 

JrXq2 = kj (2.7) 

The equatioAs (2.7) represent system dynamics when the reference, for pendulum angle and 
rotor angle is along the vertical downward axis as shown in Figure 2.1. Instead if the reference 
is taken along vertical upward axis, then kinetic energy of the system remains same, while the 
potential energy becomes P = mgl(cos Xqi — 1), which is zero at vertical upward position of 
the pendulum. In this case, the EL equations of motions take the following form: 

Jxqi — mglsinxqi = —ktl 

JrXq2 = hi ( 2 . 8 ) 


2.3 The Experimental Setup 

2.3.1 The Mechatronics Kit Hardware 

The experimental setup that we worked on is the Mechatronics Kit from Quanser. The Pen- 
dubot, the Furuta Pendulum and the Reaction Wheel Pendulum can be assembled using the 
components that are provided with the kit. The angular measurements are made using relative 
optical encoders. These optical encoders have a resolution of 4096 divisions per 27r radians. 
Therefore, the smallest angle that we can measure is 27r/4096. They measure each angle in 
24 bits. The actuating torque is given by a 24 V brushless DC motor. Details regarding the 
time constants, torques and currents can be found in the datasheet. 

The controller is implemented using TI’s (76711 Digital Signal Processor. The mother¬ 
board of the controller is TI’s C6711DSK. There is a daughter board on the motherboard 
which does the signal transmission required between the optical encoder and the controller on 
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the one hand and between the PWM card and the controller on the other. The time instants 
between the control actions has been programmed to be 5ms^. The daughter card can be used 
to print relevant control information on the LCD display. 

This Digital Signal Processor is a floating point processor, the architecture is designed in 
such a way that it works in real time - an essential prerequisite for a controller^. It uses thread¬ 
ing to accomplish the real-time capability. Of course, there is a limit to the computational load 
that the DSP can handle and still maintain its real-time performance. Further details can be 
found in the manuals accompanying the kit. 

2.3.2 The Software 

The CCStudio software which has been provided by TI for specific use with the DSP is used 
as a programming environment. The interface is very similar to the Visual Studio interface. It 
is a very powerful software environment allowing a whole range of real-time debugging tools. 

The software development cycle consists of three main tasks. First a C program is made 
which has all the necessary control actions to control the hardware and all the necessary com¬ 
munication actions to communicate with the PC. Then this is linked, compiled and assembled 
into the assembly language of the DSP'*. Finally it is downloaded into the DSP using a JTAG 
communication channel established through the parallel port. 

A Visual Basic plug-in is written so that the PC can communicate with the RTDX libraries 
functioning in the DSP. This also gives an opportunity to get make a good Graphical User 
Interface. 


2.3.3 System Parameters 

The system parameter values given in manual [30] have been used through out this work. 
These parameter values are as follows: 

nip = 0.2164 kg Jp = 0.0002233 kgm? Ip = 0.1173 m 

^Indeed, the following analysis should be digital in principle, but one uses continuous time analysis for ease 
of mathematical tractability and because it works. 

^There are libraries for Real Time Data Xchange(RTDX) 

‘‘Writing directly into the assembly language is a nontrivial task. 
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vTir = 0.0850 kg Jp = 0.00002495 kgm^ Ip = 0.1270 m 
m = 0.3014 J = 0.004572 Z = 0.12 m 

Up = = 8.85G rad/sec 

h = 0.0274 Nm/A Imax = 1-8 A 

where Up is the natural frequency of oscillation of pendulum. 

2.3.4 Some Considerations during Experimentation 

The optical encoders are relative. They have to be initialised in the zero position before 
running the controller. A function for this purpose has already been written and is in the file 
c6xdskdigio.c provided by the Quanser Consulting Inc. Directly we get the pendulum angle 
and rotor angle'^r from measurement. 

Another point of concern in the implementation of the control law is the measurement 
of disc velocity and the pendulum velocity. While it is judicious to design observers for this 
purpose, it is found that using a backward difference rule and then a first order filter is adequate 
in most control actions. 

The motor torque computed by the controller undergoes a scaling as it goes through the 
PWM card and the counters. The control variable (u) used in the computer is scaled so that 
10 units of u correspond to maximum current Imax- The following is the relation for control 
input scaling : 

kj = A:n; |u| < 10 
where proportionality constant k satisfies 

k = = 0.00493 

Taking u as control input the system dynamics can have following two forms: 

1. With vertical downward axis as the reference axis 


+ a sin x^i = —bpU 

X ~ bj-lt 


(2.9) 



2. And with vertical upward axis as the reference axis 


Xqi — a sin Xqi = —bpU 

iq2 = ( 2 . 10 ) 

where a = bp — j, and br = From specified system parameters these are calculated 
to be a = 78, bp = 1.08 and br = 198. 

Most importantly, there is a phase shift of tt between the control torque and the real torque. 
This is the most fundamental thing about this setup that one must take care of. 

2.4 Linearisation 

One of the most basic techniques in studying the stability of nonlinear systems around an 
equilibrium point is to linearise the system dynamics around that equilibrium point. If the 
eigenvalues of the state matrix of the linearised system are not on the imaginary axes, con¬ 
clusions about the stability of the nonlinear system around that particular equilibrium point 
can be drawn. In practice, most controllers for nonlinear systems whose region of operation 
is around the equilibrium point, are designed using linear control techniques on the linearised 
system. 

The following sections are on the analysis of the linearised system and the design of linear 
controller with state feedback. 

2.4.1 Analysis of Linearized Model 

In the absence of the control input the system equations reduce to, 

Xgi ■+■ asinxqi = 0 

Xg2 = 0 ( 2 . 11 ) 

The variable corresponding to the angular position of the disc does not appear in the La- 
grangian and is therefore, neglected in further analysis. By using the state variables y = 



[2/1) 2/2,2/3]^ = Xg2]^, the state equation is written as, 


y = f{y) 
r 

where /(y) = -asmyi 0 • 

The equilibrium points are the solutions of the equation /(y) = 0. It is clear that the 
solutions are (yi, y 2 , ys) = (n7r, 0, ys), n = 0, ±1, ±2.... The .solutions for even values of n 
correspond to the downward position of the pendulum and the solutions corresponding to the 
odd values of n are indicators of the inverted position. 

Linearising the pendulum dynamics about the downward equilibrium points we get, 

2/1 _ 0 1 yi 

y 2 J [ -a 0 J [ y 2 

Clearly, this is an elliptic(stable) equilibrium point. The orbits close to this equilibrium point 
in the phase-space are ellipses. Linearising the pendulum dynamics about the vertcal upward 
poistion (yi, y^) -> (yi + tt, y 2 ), we get 

2/1 _ 0 1 yi 

2/2 a 0 y 2 

This is a hyperbolic (unstable) equilibrium point. The orbits close to this equilibrium point 
are hyperbolas. The objective of linear control design problem is to stabilize the IWP at the 
vertical top unstable equilibrium point. 


2.4.2 The State Feedback Control Law for Linearized System 

The controllability of the linearized system is checked before designing a state feedback con¬ 
trol law. 

The linearised system is given in state space model as: 

y = Ay + bu 


where 


0 1 0 

A= a 0 0 , 

0 0 0 
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The condition for complete state controllability is rank{Q) = rank 6 ^6 A^b 
can be checked that this condition is satisfied because det{Q) ^ 0. 

Let us consider the state feedback control law 


= 3. It 


u = -kij/i - k 2 y 2 - hyz 

where ki, k 2 and k:^ are state feedback gains. The objective is to determine these feedback 
gains to meet desired specifications on system performance characteristics. During lineraized 
control, the initial system configuration is very near the top unstable unequilibrium position 
which can be specified by the vector y = {yi{0), 0,0)^. The objective of the state feedback 
controller is to drive the system from this initial state to final state given by y = [0,0,0]^. 
The relation between final state and initial state in terms of transfer function while including 
state feedback control law can be shown to be as follows: 




(s + brk3,){s + bpk-^ 


+ {brkz — hpk2)s^ — (a + bpki)s — abrkz 


1/1 ( 0 ) 


( 2 . 12 ) 


Since at the top equilibrium point all states must converge to zero, yi^ss = linis_o sl^s) = 
0, so also the disc angular velocity must converge to zero. For finding the feedback gains, 
following third order polynomial is selected 


(s^ + 2(us + w^){s + auj) 


with appropriate values of damping C, bandwidth u and the real pole at -aw. When compared 
with the following closed form characteristic polynomial, 

>l(s) = 5 ^ + {brkz — bpk2)s^ — (a + bpki)s — abrkz 

the state-feedback gains can be determined as follows; 


fci = 

k2 = 
kz = 


(a (1 + 2a^)w^ 

bp 

((a + 2()auj + au)^) 

abp 

abr 


(2-13) 
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In practise, the best course is to choose a damping C of around 0.7 and the value of a is fixed 
at 0.1. Then the value of lo is slowly varied from Up = 8.856 upwards untill a satisfactory 
system response is observed. The reason for choosing the third pole towards the origin rather 
than away from the origin is that the torque requirement in the latter case is larger. 

2.4.3 Simulation and Experimental Results 

Simulation and experimental results with state feedback control for linearized system are 
shown in Figures 2.2 and 2.3. The parameter value of uj is fixed at a; = 17 to obtain satis¬ 
factory response. The simulation results show that the state feedback controller for linearized 
system performs well as long as the pendulum angle jxgil < 0.12. This finding was also veri¬ 
fied during experiment. In fact this range is too small. Thus the obvious question arises if we 
can still employ other forms of linearized control to increase this range. What is the maximum 
range of the pendulum angle around top unstable equilibrium point for which the linearised 
control scheme can stabilize the system? The answer to this question will be investigated in 
the next chapter. 

However, let us take a close look at sumulation and experimental results obtained using 
this simple state feedback controller. The results are shown in figures 2.2 and 2.3. The plots 
of pendulum angle, angular velocity and wheel angle are compared in figure 1.2 while the 
plots of wheel angle velocity, control input and phase plot are compared in figure 1.3 between 
simulation and experiental outcomes. Following observations are made: 

1. The pendulum angle exactly converges to zero in simulation while this angle has a small 
offset in steady state during actual experiment. Since encoder senses position, the offset 
might have occured due to measurement offset. 

2. The pendulum angular velocity also converges to zero during simulation, however the 
same state variable appears to fluctuate around zero during experiment. It should be 
noted that the velocity variable is not directly sensed but is estimated from sensed posi¬ 
tion using optical encoder. The differentiation of position variable might have amplified 
measurement noise. 
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Smulanon Results 


Experimental Results 




(a) Pendulum Angle (b) Pendulum Angle 



(c) Pendulum Angular velocity 



s 6 7 8 9 10 


Turn tut itr) 

(d) pendulum Angular velocity 




(e) Wheel Angle (f) Wheel Angle 


Figure 2.2: Simulation and Experimental Results For Linearised Control (x^i ~ t, Xqi ~ 
t, Xq2 ~ t) 
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(a) Wheel Angular Velocity 


(b) Wheel Angular Velocity 



(c) Control Input 


(d) Control Input 



(e) Phase Plot 


(f) Phase Plot 


Figure 2.3: Simulation and Experimental Results For Linearised Control (Xg 2 ~ 

t, Xgl ~ Xgl) 
















3. Although the wheel angle converges to a constant value during simulation, the same 
variable slowly decreases during experiment. This effect can be observed as the disc is 
very slowly rotating during experiment. 

4. However wheel angular velocity follows same pattern during both simulation and ex¬ 
periment. ^ 

5. The control input is very smooth during simulation while it is fluctuating during ex¬ 
periment. However, the interesting aspect is that this fluctuation occurs approximately 
around the smooth values predicted by simulation. 

6. The phase plot during simulation is very smooth while during experiment, this plot has 
fluctuations around the smooth trajectory predicted by simulation. 

2.5 Concluding Remarks 

In this chapter, the system dynamics of the Inertia Wheel Pendulum are derived using EL for¬ 
mulations. The system parameter values as given in experimental kit manual[30] are taken for 
designing various controllers during thesis work. The bonafide nature of system parameters 
is established by comparing simulation and experimental results while implementing a simple 
state feedback controller. 


* * 
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Chapter 3 

A hybrid Control of IWP 


3.1 Introduction 

In this chapter a control approach for complete stabilization of Inertia Wheel Pendulum with 
a dual mode control is revisited. It is first shown that the system is locally feedback lin- 
earizable by a local diffeomorphism in state space and nonlinear feedback. We compare the 
feedback linearization control with a linear pole-placement control for the problem of balanc¬ 
ing the pendulum about the inverted position. For the swingup problem a controller based on 
collocated partial feedback linearization has been discussed. As basic concept of feedback 
. linearization some part of Differential geometry (see [2] for details) is covered in this chapter. 

3.2 Feedback Linearisation 

The goal of feedback linearisation is to linearise the system via feedback. It is difficult, given 
an arbitrary state equation, to guess a co-ordinate transformation so that the state equations 
become linear. In control theory, differential geometric tools have been used to find the con¬ 
ditions under which there is a local diffeomorphism*. These tools also provide the procedure 
in finding this diffeomorphism. In case this, diffeomorphism can be made global, the system 
is called globally feedback linearisable.A thorough exposition of this is given by the classic 
books by Isidori[3],[2]. 

'a function f : U -* V is a diffeomorphism if it is bijective and both / and /_i are continuous and have 
derivatives of all orders. 



In this chapter, it is shown that the reaction wheel pendulum is partially feedback lin- 
earizable in a certain domain of its state-space. Spong et al.[l] showed for the first time that 
the reaction wheel pendulum is partially feedback linearisable in a limited region of its state 
space. The analysis is an application of.the technique proposed in [1]. Then the simulations 
and experiments results are presented. 

3.2.1 Mathematical Preliminaries 

Before performing the feedback linearization, one requires to know some fundamental con¬ 
cepts of Differential Geometry. In this subsection some portion of differential geometry is 
covered. This includes Lie derivetives. Lie brackets, Diffeomorphism, State transformation 
and Frobenious theorem. 

Lie Derivatives and Lie Brackets 

Definition 3.1 Let h : —^'^be a smooth scalar function, and i : R” be a smooth 

vector function on R”, then the Lie Derivative of h with respect to f is a vector field defined 
by 

Lf h = V/i f 

Thus, the Lie derivative Lf h is simply the directional derivative of h along the direction of 
vector f. Repeated Lie derivative can be defined recursively as 

L^f h = h 

L\h = LfiLff^ h) = h) f 

Similarly, if g is another vector field, then scalar function L^Lf h(x) is 

LgLf h = V {Lf h) g 

Definition 3.2 Let f and g be two vector fields on R”. The Lie Bracket off and g is a third 
vector field defined by 





The Lie bracket [f, g] is commonly written as adf g. Repeated Lie brackets can then be 
defined recursively by 

ad? g = g 
ad} g = [f, adp^ g] 

Diffeomorphisms and State Transformations 

The concept of diffeomorphism can be viewed as a generalization of the familiar concept of 
coordinate transformation, according the definition stated in Slotine and Lee[4] 

Definition 3.3 A function $ ; M" E", defined in a region Q,, is called a diffeomorphism if 
it is smooth, and if its inverse exists and is smooth. 

If the region O is the whole space R”, then ^{x) is called a global diffeomorphism. if the 
function is defined for a finite neighbourhood of a given point, then 3>(a:) is called a local 
diffeomorphism. If the Jacobian matrix V# is non-singular in a subregion of 0 around any 
point, then $(x) defines a local deffeomorphism in the subregion. A diffeomorphism can be 
used to transform a nonlinear system into another nonlinear system in terms of a new set of 
states. Consider the dynamic system described by 

X = f{x) -1- g{x)u 

and let a new set of states be defined by 

2 = $(x) 

differentiation of z and use of dynamic equation leads to 

One can easily write the new state space representation as 

i: = fiz) + g(z) 

y = Hz) 

where x = (z), and the functions /, g, h are defined accordingly. 
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Frobenious Theorem 


The Frobenious theorem is an important tool in treatment of feedback linearisation for non¬ 
linear systems. It provides necessary and sufficient condition for solvability of a special class 
of partial differential equations. The statement of the theorem is given, after giving formal 
definitions of complete integrability and involutivity. Detail proof of the theorem is derived in 
[2] and [4]. 


Definition 3.4 A linearly independent set of vector fields {fi, f 2 ,..., fm} on K.” is said to be 
completely integrable if and only if there exists n—m scalar functions hi{x),h 2 {x),, hn-m (a;) 
satisfying the system of PDFs 

Vhi fj = 0 , (3.1) 


where — m, and the gradients V/ij are linearly independent. 

Note that with the number of vectors being m and the dimension of the associatedspace being 
n, the number of unknown scalar function hfs involved is n — m and the number of PDFs is 
m(n — m). 

Definition 3.5 A linearly independent set of vector fields {fi, f 2 ,..., f„i} on R" is said to be 
involutive if and only if, there are scalar functions aijk .' R" R such that 


m 

[fj, f,] (x) = ^ ayfc( 2 :)ffc(a:) Vi, j (3.2) 

fc=l 

Involutive means that if one forms the Lie bracket of any pair of vector fields from the set 
(fi, f 2 ,..., fin}, then the resulting vector field can be expressed as a linear combination of the 
original set of vector fields. Note that 

1. Constant vectorfields are always involutive, indeed the Lie bracket of two constant vec¬ 
tors is simply a zero vector, which can be trivially expressed as linear combination of 
vector fields. 


2. A set composed of single vector f is involutive. Indeed, 
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3. From Definition 3.5, checking whether a set of vector fields {fi, f 2 , • • •, fm} is involutive 
amount to checking whether 

rank (fj, £ 2 ,..., f„j) = rank (fj, £ 2 ,. -., fm, [fj, fj(2:)) 
for all X and all i, j. 

Now the Frobenious theorem can be stated [4] as 

Theorem 3.1 let 2 ,... ,frn be a set of linearly independent vector fields. The set is com¬ 
pletely integrable if and only if it is involutive. 

3.2.2 Feedback linearisation or Input-State Linearisation 

In this section input state linearization for single input nonlinear systems has been discussed, 
which can be represented by the state equations 

X = f(x-) + g(x')u (3.3) 

with f (■) and g(-) being smooth vector fields. The systems in this form are said to be linear 
in control or affine form. 

Definition 3.6 A single input nonlinear system in the affine form of (3.3), with f{x), g{x) 
being smooth vector fields on R”, is said to be input state lineaiizable or feedback linearisable 
if there exists a region Q in R", a diffeomorphism T : Q —* R”, and a nonlinear feedback 
control law 

u = a{x) + P{x)u (3.4) 

such that the new state variables z = T{x) and the new input v satisfy a linear time invariant 
relation 

z = Az3-bu (3.5) 


where 




0 10-0 
0 0 1-0 

A= . b = 

0 0 0-1 
0 0 0-0 

From above we get the equalities 

zi = 2:2 

i2 = -23 

Z 3 = Z 4 (3.6) 


Zn = U 

The new state 2 ; is called the linearizing state, and the control law (3.4) is called the lineariz¬ 
ing control law. The transformed linear dynamic has its A and b vector of a special form, 
corresponding to a linear companion form. 

Conditions for Input State Linearization 

At this point a natural question comes : Are all nonlinear state equations in the form of 
(3.3) feedback linearizable ? If not, what is the condition for such linearization exists ? The 
following theorem provides answers to these questions. Only statements of the theorem is 
given here. The detail proof is given in [4] and [2]. 

1 

Theorem 3.2 The nonlinear system in affine form (3.3), with f{x) and g(x) being smooth 
vector fields, is input state or feedback linearizable if and only if, there exists a region such 
that the following conditions hold : 

1 . the vector fields {g, adfg,..., adf”~’'g} are linearly independent in Q, 

2 . the set {g, ttdfg,..., adf‘~^g} is involutive in 
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The first condition can be interpreted as simply representing a controllability condition for 
nonlinear system (3.3) analogous to the controllability condition of a linear system. The vec¬ 
tor field [g, adfg,..., ad/^'^g] become [6, Ah ,and therefore their independence 
satisfies the invertibility of controllability matrix. So the first condition is a general control¬ 
lability condition for both linear and nonlinear systems. The second condition is trivially 
satisfied for for linear sytems (which have constant vector fields), but not generally satisfied 
in the nonlinezir case. 

How to Perform Input State or Feedback Linearization 

Based on previous discussion the feedback linearization of a nonlinears system of form (3.3) 
can be performed through following steps : 

1. Construct the vector field {g, adfg,..., o(if"~^g} for the given system. 

2. Check whether controllability and involutivity conditions are satisfied. 

3. If both are satisfied, from definition 3.6 we get, zi = Ti{x) where If denotes the z-th 
component of vector valued function T(a;).From the equalities (3.6) we get 


Zi — VTii = LfTi I/gTi — Z2 — T^ix) 
As T 2 {x) is independent of u, following is true 


T 2 {x) = LfTi and LgTi = G 
Repeating the reasoning fot Z 2 , z^,..., shows that 

LfTf = ri+i, IgTf = 0, /ori = l,...,n-l (3.7) 

Now let us show that the above equation on the Tf’s can be composed into a set of con¬ 
straint equations on Tj alone. To begin with lets consider the terra vri[f, g], equation 
(3.7) implies that 

vrjf, g] = V(IgT0f - V(LfTx)g = 0 - IgT2 = 0 
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By proceeding inductively we can show. 


VTiadfg = 0 for A: = 0,1, 2,..., n — 2 (3.8) 

Finally for i = n we have 

Zji — LfTn + uLgTn — ^ (3.9) 

Which says LgTn ^ 0 due to dependency of the above equation on input u, which gives 

VTiad^-^g^O (3.10) 

Hence 

-Z® = and ^ = I3{x) 

3.2.3 Feedback linearisation of IWP 

To perform the feedback linearization on Inertia Wheel Pendulum, consider the dynamic 
model of the form 


Jxgi = mgl sinxqi — ku 

Jj'Xg^ —" ku 


(3.11) 


where Xqi = '0p and Xq 2 — 'tpp+i’r, and V’r are pendulum angle and rotor angle respectively 
with reference to the vertical upward position of pendulum, u is the input to PWM driver and 
A: is a proportionality constant. Taking xi = Xgi, X 2 = Xqxandxz = Xqz the equation (3.11) 
takes the form 




X2 


0 


= 

a sin xi 

+ 

6p 

. . 


0 


. . 


(3.12) 
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where a = bp = ^ and br — As the rotor angle position does not have importance, we 
have omitted it from the equation (3.12). Equation (3.12) is in affine form of relative degree 


n = 3 with 



X2 


0 

fi^) = 

asinari 

and g{x) = 

bp 


0 


br 


Following steps are followed to perform feedback linearization and to find out the control 
input for Inertia Wheel pendulum system. 

STEP 1: For the reaction wheel pendulum, the vector fields {g, adf{g), adj(g)} are given by 


adf(g) 


adjig) 


abp cos Xi 
0 
0 

—abpX 2 sin Xi 
—abp cos Xi 
0 


STEP 2 : Since det{ f adf{g) cicPf{g) 
linearly independent in the region |xi| < |. 


) = —abrbpCos'^xi this set of vector fields is 
To test involutivity, observe that. 


[c/, adfig)] = ^^^g - 


0 

0 

0 


which is a linear combination of the original vector fields. 

STEP 3 : To find out the transformation we follow the step 3 said in last subsection, so 
conditions imply 



The simple choice Ti = hrX 2 + bpX^ satisfies the above three conditions. The other two 
components of the transformation can be computed as 


T 2 = LfT\ = a 6 r sin .Ti 
Tz = LfT2 = abrX2 cosxi 

Finally we get control input 

h> — o^br cos Xi sin Xi + ahrx\ sin Xi 

u — -=- 

abpb^ cos xi 

So defining new coordinate z = T(x) i.e. 


(3.13) 


Zi = brX2 + hpXz 

Z 2 '= abr sin Xi (3.14) 

23 = abrX 2 cos 


the system equations (3.11) takes the linear form of equation(3.5). 

3.2.4 Control Design 

As it is seen in the last section, the diffeomorphism T'(x) transforms the system equations to 
the following form: 


Z'l = Z2 
Z2 = Zz 
zz-y 

This is valid for |xi | = (Xgi| < 7 r/ 2 ^. Now the linear state feedback technique can be used to 
get the following control law, 


u = —kiZi — k2Z2 — kzzz 


(3.15) 


^Thi.s kind of feedback linearization is al.so applicable about the downward equilibrium position. 



where K = 


h 




kz 


T 

is called the feedback gain matrix. 


characteristic equation with the standard third order polynomial 


Again, one compares the 


(s + au))(s^ + 2 (ujs + w'^) 

to get the values of (ki, ^- 2 , ks) for the desired controller response. Note that, this guarantees 
(^i , Z 2 , Z 3 ) 0 which in turn can be seen to indicate (xi,X 2 , x^) 0 


3 . 2.5 Simulation and Experimental Results for Feedback r 

The simulation and experimental results are shown in figures 3.1 and 3.2. The parameter 
values of uj, , a, C are fixed at 12.0racf, 0.2 and 0.6 respectively. The simulation results 
shows that, lor pendulum angle and pendulum angular velocity within range |a;,l| < 0.12 and 
\Xgi < 2.0, the balancing controller works well, this is also observed with experiment. So the 

above ranges specify the domain of attraction of the vertical upward position with balancing 
controller. 

Now let us give a close look at the simulation and experimental results for balancing con¬ 
troller making use of feedback linearisation. Theplots of pendulum angle, pendulum angular 
velocity and wheel angle are compared in figure 3.1 while the plots of wheel angular velocity, 
control input and phase plot are compared in figure 3.2 between simulation and experimental 
results. The observations are summerised as below; 

1. The pendulum angle converges to zero in simulation while the angle has a small steady 
state offset during actual experiment. This offset is due to encoder measurement offset. 

2. The pendulum angular velocity obtained by both simulation and experiment converges 
to zero at steady state. But the pendulum angular velocity obtained from experiment has 
some oscillations. As the angular velocity is estimated from sensed angular position by 
optical encoder, the differentiation of position variable amplifies the measured noise, 
which appears as oscillations in pendulum angular velocity. 

3. In simulation the disc angle finally reaches to a steady state value, but in experiment 
due to presence of small roatation it continuously goes on increasing 



Simulation Resuln 


Experimental Results 



(a) Pendulum Angle Xqi 


(b) Pendulum Angle Xqi 



(c) Pendulum Angular Velocity Xqi (d) Pendulum Angular Velocity Xqi 



(e) Wheel Angle Xq2 (f) Wheel Angle Xq2 


Figure 3.1: Simulation and Experimental Results of Feedback Linearization (Xgi ~ t, Xqi ~ 

^g2 I’’) 
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4. The disk velocity decreases to zero in simulation, but some velocity remains in experi¬ 
mental results. 

5. The control input is very smooth in case of simulation, but in experiment it is too much 
noisy. As we are estimating the inpu from state variables, presence of noise the mea¬ 
sured and estimated states might be the cause of presence of noise in input. 

6. Phase plot obtained in experiment is nearly identical with the result obtained in simula¬ 
tion except a small disturbance is present. 

3.3 Partial Feedback linearization 

In this section a technique used by Spong[31] (to control the Acrobot) is used to control the 
Reaction Wheel Pendulum. The object of this strategy is to swing up the pendulum from any 
position to a point close to the inverted position. Note that it does not balance the pendulum 
at the inverted position. However, it is possible to switch to a balancing controller once it 
reaches into the region where balancing controller works.. 

3.3.1 Analysis 

The basic idea is to pump energy into the system by rotating the disk in phase with the motion 
of the pendulum so that the amplitude of the pendulum increases with each swing. This is 
called the collocated partial feedback linearisation strategy in [31] because the actuated joint 
is linearised. Consider the following equations of the reaction wheel jpendulum 

Xi — asinx'i = —bpU 

X2 = brU (3.16) 

Where the reference for pendulum angle and wheel angle tpr is the vertical downward axis. 
The second equation is just a double integrator and can be linearised by the folowing choice 
of control law. 


U = X2d- k2(X2 - X2d) - (X2 “ X2d) 


(3.17) 



where x^d is the reference trajectory. It was shown in [32] that an appropriate choice of the 
reference trajectory can be made so that the resulting zero dynamics^ are unstable. This causes 
the pendulum to swing to a position near the inverted poition. 

The choice of the reference trajectory which causes such a wonderful thing to happen is 

x-id = -kzcr{x{) (3.18) 

1 

where cr(-) is a sigmoidal function. A heuristic explanation of the reason for this based on 
energy arguments is given in [31], while a more rigorous explanation based on the analysis of 
the resulting zero dynamics is presented in [32], 

3.3.2 Control Design 

We have four design parameters, namely {ki,k 2 i kz) and the choice of the function ( 7 (-). Un¬ 
like in the case of the Acrobot where teh position of the second link is a matter of grave 
concern and the choice of a smooth sigmoidal function has its benefits, we opt for an abrupt 
nonlinearity, i.e. 

a{xi) = spn(xi) (3.19) 

The parameters (fci, kz) can be assigned for a good enough second order controller response. 
The values of ^3 and the speed of the controller response have to be determined by looking 
at the response of the real system. Roughly, one wishes to increase the energy transfer beteen 
the links so we wish that the second link reach its reference trajectory in a leisurely way. 

3.3.3 Simulation and Experimental Results of Complete Stabilization 

We combine this swingup routine with the linear balancing controller to present a hybrid con¬ 
troller strategy, which succeeds in swinging up th pendulum from the downward position and 
stabilising it in the inverted position. The choice of the angle where the switching takes place 
is determined experimentally to be 0.11 rad in either direction. For the partial fee back lin¬ 
earization controller, the parameter values of u and C are fixed at 15.0 and 0.7 respectively. 
^The evolution of the unlinearized link( the pendulum angle). 
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Simulation Results 


Experimental Results 



(a) Pendulum Angle Xqi 


(b) Pendulum Angle Xqi 



(c) Pendulum Angular Velocity Xqi (d) Pendulum Angular Velocity Xqi 



(e) Wheel Angle Xq2 (f) Wheel Angle Xq2 


Figure 3.3: Simulation and Experimental Results of Complete Stabilisation (Xgi ~ t, Xgi ~ 

ij Xq2 ^ 
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Simulation Results 


Experimental Results 



(a) Wheel Angular Velocity Xq2 (b) Wheel Angular Velocity Xq2 



(c) Control Input u 


(d) Control Input u 



(e) Phase Plotr^i Xqi (f) Phase Plot Xqi Xqi 

Figure 3.4: Simulation and Experimental Results of Complete Stabilisation (x^a ^ t, u 

t, Xgl ~ X 51 ) 
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From both simulation and experiment, it is determined that when pendulum angle reaches 
within the range of ±0.12 rad the pendulum angular velocity remain well within the range 
of ±2.0, hence switching over to balancing control stabilizes the pendulum at unstable equi¬ 
librium point. Simulation and experiment results for complete stabilization are compared in 
figure 3.3 and 3.4. Her we give a close look to the simulation and experimental results. 

1. The figure 3.3 compares the simulation and experimental results of pendulum angle. 
Pendulum angular velocity and wheel angle. 

2. It is observed from figure 3.3 that, in simulation the pendulum takes less time to reach 
vertical upward position from downward position, where as in experiment it takes a 
longer time. Perhaps this time difference is due to presence of friction which we are 
neglecting in simulation. In experiment we are injecting energy into the system to lift 
the pendulum up, if some part of it losses due to friction, then obviously it will take 
longer time than expected without friction. 

3. Figure 3.4 gives a comparison between simulation and experimental results for Wheel 
angular velocity, control input and phase plot. 

4. In simulation it is observed that the disc velocity finally goes to zero at stedy state, 
but in experiment there remains a small velocity. Possibly this is due to presence of 
small torque exerted by the dc motor supply wire. To neutralise the wire torque the disk 
rotates at a small speed. This is the cause of continuous increase in wheel angle. 

5. Control input responses for both simulation and experiment are almost identical during 
swing up of pendulum, but some amount of noise is present during stabilization. 

3.4 Concluding Remarks 

In this chapter we revisited the hybrid nonlinear control of a Inertia Wheel Pendulum to 
achieve both swingup and balancing. We investigate the balancing control strategy based 
on feedback linearization and swing up based on partial feedback linearization. We have 
shown that the system is locally feedback linearizable by a local diffeomorphism (i.e. in 
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region lar^il < |) in state space and nonlinear feedback. In practice the performance of 
feedback linearization control is comparable to approximate linearization and pole placement 
control disscussed in Chapter2 .The performance of the system is limited by finite actuator 
torque capability. The swingup control we discussed is based on collocated partial feedback 
linearization. 

:ic :ic s|« 
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Chapter 4 


Passivity Based Control of IWP 

4.1 Introduction 

Nonlinear systems has undergone tremendous development over the last three decades. Intro¬ 
duction of many techniques has greatly advanced the theory and has enabled the generalization 
of many fundamental concepts for linear systems to the nonlinear world.- In last decade the 
theory of passive systems has brought a revolution in this area. With this theory the latest 
developments on possibility of shaping the physical energy in such a way that it can be used 
as a suitable Lyapunov function for the control purpose has led to the control technique what 
is called Passivity-based Control. Passivity based control (PBC) is a well established structure 
preserving design methodology which has been shown to be very powerful to design robust 
controllers for physical systems described by Euler Lagrangian (EL) equations of motion. 
For regulation problems of mechanical systems it provides a natural procedure to shape the 
potential energy yielding controllers with a clear physical interpretation. 


4,2 Dissipativity and passivity 

The properties of dissipativity and passivity are very important in the analysis of EL systems. 
The theory presented is mostly based on passive systems. A passive system is a system that 
can not store more energy than the energy supplied to it. In other words, a passive system 



does not contain an energy source. Consider the non-linear system 


= fixit)) + g(x{t))u{t) 

y{t) = h{x(t)) + j{x{t))u{t) (4.1) 

where x E M.^,u,y E R'”, and the nonlinear functions, defined by are 

smooth. Let us call S{u{t), y{t)) the supply rate, such that \/u, Vx(0) 

/o*|S'(u(t),y(t))|d£< oo iER+ 

Definition 4.1 (Dissipativity) An EL system is dissipative with respect to the supply rate 
S{u, y) if and only if there exists a storage function H such that 

H(x{t)) < H{x(0)) + [ S{u{t),y{t))dt (4.2) 

Jo 

Vw andyx(0). 

This is called dissipation inequality, which means the storage energy function H{x{t)) at a 
future time t is not bigger than the sum of the available energy function H{x{0)) at an initial 
time 0 plus the total energy S{u{t), y(t))dt supplied to the system from external source in 
the interval [0, t], There is no internal creation of energy. 

Passive system represent an important subset of dissipative systems. 

Definition 4.2 (Passivity) A system is passive if it is dissipative with supply rate S{u, y) — 
u^y. Because the supplied energy is greater than or equal to the total energy H, the passivity 
property can also be written as 

H(x{T))-H{x{0))< / u^it)yit)dt (4.3) 

Jo. 

Vn(£), VT > 0 and all H{x{0)) 

Definition 4.3 The system given in equation 4.1 is passive if there exists a (locally) positive 
definite scalar function H, called a storage function, such that 

H < u^y (4.4) 
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Integration of equation 4.4 leads to same condition as given in definition 4.2 or equation 4.3. 


4.2.1 Passive systems and their properties 

Consider a mass-spring-damper system with equation of motion 

Mx + Bx -H Kx = u 


where M is the mass, B is the damping coefficient and K is spring constant. The total energy 
of the system is given as 


Then H satisfies 


H = 


H = Mxx 4- Kxx 

= XU — Bx^ < XU 


Therefore mass-spring-damper system is passive if one takes force u as the input and velocity 
X as the output (equation 4.4), but the system ceases to be passive if one takes position x as 
output since one can not verify the relation H < xu. Let us revisit some of the properties of 
passive systems: 

1. A linear system is passive if the transfer function that relates its output with input has 
relative degree one. 

2. A passive system can be easily stabilized by proportional feedback of passive output 
as shown in Fig (4.1). Since the system G is passive, the storage function H follows 
the relation H < xFy. Because of proportional feedback, setting u = —ky, the rate 
derivative of the storage function H satisfies the following relation: 

H<u'^y = -ky^y < 0 

for any positive value of gain k. This implies that ff is a decreasing function, i.e. 
the function H will continuously decrease until output y is not zero. Using LaSalle’s 
theorem [24], one can show that the system is asymptotically stable. 
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Figure 4.1: Output feedback passive system 


3. The proportional gain introduced for a passive system is normally referred as infinite 
gain margin. This is because the system is always stable even if the gain is very large 
as long as it is positive. 

4. If passive systems are connected in parallel, the resulting system is also passive as 
shown in Figure 4.2. With reference to figure 4.2, let Hi and H 2 are the stored energy 



Figure 4.2; Parallel interconnection systems 


function corresponding to respective subsystems Gi and G 2 . Let yi and t /2 are passive 
outputs of these two sub-systems respectively while u is the input to each subsystem. 
Since both systems are passive then following relation holds good: 

Hi < vFyi 
H2 < U^t/2 
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For the combined parallel interconnected system, the stored energy is iJi + i? 2 - Thus 
the following relation also holds good: 

=> Hi + H2 ^ ^^(2/1 + 2/2) 

Since 2/1 + 2/2 = y, the output of the combined system, the combined system is also 
passive. 

5. If passive systems are connected in feedback, then the resulting system is also passive 
as shown in figure 4.3. Let two systems Gi and G 2 are associated with stored energy 



Figure 4.3: Feedback interconnection systems 


functions Hi and H 2 respectively. The following conditions are true for individual 
systems: 

Hi < u^yi 

< {'f- y2Yy\ 

H2 < ylyi 

Since the stored energy function of the combined system is =: ffi + H 2 , one can easily 
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obtain following relation: 


Hi+H 2 < (r - y 2 )^yi + yiy 2 
< r^yi 

where r is the input and yi the output of the combined system. Thus it is clear that the 
feedback interconnection of passive systems is also passive. 


4.3 Passivity Based Control (PBC) of IWP 

The system dynamics of the Inertia Wheel Pendulum system as described in chapter 2 is 
reintroduced here: 


Jxqi + mgl sin Xqi — —ku 

JrXq2 = ku (4.5) 

where Xqi — 'tpp and Xq 2 — i^p+'ipr, i’p and are pendulum angle and rotor angle respectively 
measured from vertically downward axis. J and Jr are inertia parameters, u is the input to 
PWM driver and A: is a proportionality constant. 

The inertia wheel pendulum system can be thought of as an parallel interconnection of 
two subsystems i.e. pendulum subsystem and wheel subsystem with input u as shown in 
Figure (4.4). Then the control problem is to determine u — f(yi,y 2 ) such that the parallel 
interconnection system as shown in figure 4.4 is passive. 

Before one can determine the control law u = f(yi, 2 / 2 ), one needs to establish that each 
subsystem shown in figure 4.4 is passive. Consider the subsystem wheel whose transfer func¬ 
tion is The energy storage function Hi for the wheel is very simple to find out 

which is Hi = ^Jril 2 - The rate derivative of this storage function can be simplified using the 
relation JrXq 2 = ku given in equation 4.5: 

Hi = JrXq2Xq2 = kuXq2 (4.6) 
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Figure 4.4: IWP system as a parallel interconnection 


If one selects Xq 2 as the output yi of the wheel sub-system, then this subsystem is passive as 
shown in equation 4.6. 

Next consider the second subsystem which is the pendulum system. The general form of 
energy function of this subsystem is given as follows: 

, E = + mgl{l - cosrcgi) (4.7) 

The rate derivative of this pendulum energy follows: 

E = JxqiXgi+mgl sin {xqi)xqi 
= {Jxqi H- mgl sin (x 5 i))xgi 

= -kuxqi (4.8) 

It can be seen in equation 4.8 that the energy is constant along the state trajectories if u = 0. 
In addition, the pendulum subsystem is passive from input —ku to Xgi, with the energy E as 
the storage function. However instead of taking the energy E as the stored energy function, 
the stored energy fumction is taken as follows, the motivation for which will be clear shortly: 

(4.9) 

where Eref is a (constant) reference energy. The rate derivative of this new stored energy 
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function yields following relation: 


H2 = {E-Eref)E 

= -kXqi{E - Eref)'iJ^ 

= y2U (4.10) 

where the output function is taken as 2/2 = —kd:qi(E — Eref)- The reason for such choice of 
y 2 and H 2 will be explained shortly. Since the pendulum system is passive as per equation 
4.10 from input u to output y 2 , the parallel interconnection system of pendulum and wheel is 
also passive. 

It is verified that the combined wheel and pendulum system is passive from input u to 
output yi + 2/2 using property (4) in section 4.2.1. Instead of taking the ouput of combined 
system to be yi+y2, it is taken as y = k^yi + key2 to add extra degree of freedom to control 
design through proper selection of parameters ky and k^.. The modified storage function for 
the combined system thus becomes; 

H = kyHl+k^H2 

= \kyJr^q2^\ke{E-Er,ff (4.11) 

Computing H along the state trajectories of the combined system yields: 

H = {kykXq 2 - kek{E - Eref)xqi)u = yu ' (4.12) 


If one selects control input as follows: 

'll ““ "“kyy — '~ky(kykxq2 kgki^E “■ E^^j^Xq^ 
then equation 4.12 is further simplified as: 

H = -A:„y2 < 0 


(4.13) 


(4.14) 
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Thus the system is stable as per equation 4.14 using Lasalle’s invariance theorem, and now 
we can determine the asymptotic behavior of the system making use of LaSalle’s invariance 
principle. Setting y = 0 to make H = Q yields 

U = ku{kek(E - Eref)iql - kykXg2) = 0 (4.15) 

It follows that the time derivative m = 0 from which we get 


ky,(kekEXqi + kek(E — Eref)Xqi — kykXg^) = 0 


Since u = 0, Xg 2 = E = —kuxqi, and x,,i = — ^sinx^i — ku, the above equation 
reduces to 

{E — Eref) sin Xgi = 0 (4.16) 

This equation is very important and says that the closed loop trajectory will converge to either 
E = Eref or sina:,! = 0. From the first case E = Ere/ in addition to equation (4.13) we 
get Xq 2 = 0. In second case, it follows that Xgi = nn and Xq 2 = constant. One thing to be 
noted down here, this PBC only swings up the pendulum up to vertical upward position but 
not stabilizes there, because the controller brings the trajectory into a homoclinic orbit which 
intersects both stable and unstable equilibrium points. 

An extremely important advantage of the PBC is that the analysis remain unchanged if, 
instead of using control law defined by (4.11), we use saturated control 


u = sat[—ky,y) = sat{—ky{kykxq 2 - kek{E — Eref)iqi)) (4.17) 

where jSai(-) is a saturation function. Then tFe storage function H satisfies the condition 

H = -y sat{kyy) < 0 


• Hence the system is stable with saturation control. 



4.3.1 Effect of friction on Controller 


The friction has a significant effect on the performance of the controller which can be seen 
from simulation and experimental results in Figure 4.5 and 4.6. Considering the friction at 
pendulum hinge point and neglecting the wheel friction (as it can be compensated by adding 
equivalent amount of motor input) the equations 4.5 becomes 

Jxqi + hxqi + mgl sin x^i = —ku 

JrXq2 = ku (4.18) 

where b is the coefficient of rotational friction at the hinge point of pendulum. We keep the 
energy function H, output y and input u functions of subsystems same as before. As the stored 
energy of the whole system is H = ^kyJrX^^ + \ke{E — Erej)^, then H along the trajectory 
is 


H = {kykXq2 - kek{E - Ere/)x^l)u “ b{E - Eref)ili 
- yu- b{E - 

= -kuy^-b{E-Eref)x% (4.19) 


Now we can analyse the effect of friction by superimposing it on the ideal model. The stored 
energy decreases or remain constant if H < 0, except at the vertical upward position the term 
—b(E — Eref)Xgi is always positive (as E < Erej and Xgi is non zero), and the term —k^y^ is 
negative (Except at the vertically upward position of pendulum where it is zero) and gradually 
approaches zero as the pendulum approaches vertical upward position, so there is some region 
around the vertical upward position in which the rate of stored energy H is positive. Hence the 
trajectory never goes into that region. To make the region small i.e. to increase the amlitude 
of oscillation we take the reference energy Eref a bit higher value than the actual value, by 
which the control input at each instant increses due to increase in difference B Eref- 

5PT‘rfV;TT«r f 

A... 



4.4 Simulation and Experimental Results of Passivity Based 


Swing up 

The Figures 4.5 and 4.6 show the simulation and experimental results of passivity based swing 
up control on inertia wheel pendulum. In simulation results shows the responses without and 
with pendulum friction. We set the reference energy , Eref equal to the potential energy of the 
system at vertical upward position which is Eref = 2mgL The system parameters are given 
in the chapter 2. The gains ke, ky and ku are chosen to be, 

ke = 4000, ky = 4, ky, = 0.4 

Figure 4.5 gives a comparison between simulation results and experimental results of pendu¬ 
lum angle, pendulum velocity and wheel angle, where as the Figure 4.6 gives a comparison 
between simulation and experimental results of whell angular velocity, control input and pen¬ 
dulum energy. Now lets have a close observation of simulation and experimental results. 

1. As the simulation result shows, the pendulum angle oscillates between ±7r rad if we 
neglect pendulum friction, where as in the presence of friction it oscillates between 
±2.74 rad. The experimental result of pendulum angle exactly follows the response of 
simulation result with friction. 

2. The maximum value of pendulum angular velocity for both simulation and experimental 
results are almost same. 

3. As we observe from simulation result the wheel angle goes to zero as soon as the pen¬ 
dulum achieves its peak amplitude neglecting pendulum friction. Where as in presence 
of friction it oscillates between a some value. But the experimental results shows that 
the wheel angle goes on increasing, this is due to presence of a small positive average 
wheel angular velocity. 

4. The wheel angle converges to zero during steady state if we neglect pendulum friction, 
where as it oscillates about zero with some amplitude in presence of friction. Experi¬ 
mental result also shows the small oscillations in wheel angular velocity. 
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Simulation Results 


Experimental Results 



(a) Pendulum Angle Xqi (b) Pendulum Angle Xq\ 




(c) Pendulum Angular Velocity Xqi (d) Pendulum Angular velocity Xq\ 




(e) Wheel Angle Xq2 (f) Wheel Angle Xq2 


Figure 4.5: Simulation and Experimental Results of Passivity based swing up (xgi ~ t, Xqi ~ 

Xq2 ~ t) 
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Simulation Results 


Experimental Results 



(a) Wheel Angular Velocity Xq2 (b) Wheel Angular velocity Xq2 
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(e) Pendulum Energy E 


(f) Pendulum Energy E 


Figure 4.6: Simulation and Experimental results of Passivity Based swing up (i ,2 t, u 
t, E 
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5. The control input ceases to zero as soon as pendulum oscillation achives stedy state 
value if we neglect pendulum friction, where as it oscillates about zero with some am¬ 
plitude in presence of friction. It means energy is being supplied into the system to 
compensate the frictional losses with oscillation of pendulum. 

6. In presence of friction there is a small steady state error in pendulum energy, where as 
the energy exactly converges to Eref in case friction is neglected. 

Figmre (4.5(a)-4.5(e)) shows the simulation results without and with friction. The peak 
amplitude of swing up finally reaches to ±7r and but with friction, it goes up to ±2.74rad 
(fig 4.5(a)) without friction. In absence of friction, wheel velocity goes to zero when the 
peak amplitude of swing up is reached. In presence of friction i oscillates with amplitude 
of ±100rad/sec (fig 4.5(c)), To maintain this wheel velocity, torque also oscillates within 
a range (fig 4.5(d)). The stored energy never reaches 2mgl due to the presence of friction 
(fig 4.5(e)). Figure(4.5(f)) shows the phase plot of pendulum angle with pendulum angular 
velocity. 


4.5 Experimental Results for Complete stabilisation 

We are using a Linear State Feedback (LSF) controller to stabilize it at the vertical upward 
position. The responses with combined controller are shown in Figure 4.7. The balancing 
controller works within a band of ±0.12 radian about the vertical upright position. So we 
need to bring the pendulum by swing up controller into a region of Xqi > ±3.02159 radian 
from which we switch over to Linear State Feedback (LSF) controller. But due to presence of 
friction the pendulum does not enter into the region. So we intentionally add a small amount 
of constant energy term to Eref to make the region small in which H is positive. Hence the 
amplitude of pendulum oscillation increases to the desired value from which we can switch 
over to LSF control. Figure4.7(f) shows the phase plot of pendulum angle with pendulum 
angular velocity with the combined swing up and balancing control 
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(kmd^sec) 


(a) Pendulum Angle Xqi 


(b) Pendulum Angular Velocity Xgi 




(e) Pendulum Energy (f) Phase Plot ~ Xgi 


Figure 4.7: Experimental results of Passivity Based swing up with LSF balancing 
COntrolCx,! ~ t, Xgi ~ t, Xq 2 ~ ig2 ~ u ~ ~ t) 

















4.6 Concluding Remarks 


In this chapter the application of non-linear control theory to the problem of swingup is in¬ 
vestigated, the passivity based swing up control brings the pendulum from vertical downward 
position to a close region of vertical upward position. Experimental results shows that the 
swingup based on passivity responds quite faster compared to the Partial Feedback linearisa¬ 
tion method described in chapter 2. Using passivity together with La Salle’s invariance prin¬ 
ciple allowed us to analyse complete behavior of the closed loop system. With this method 
we can only swing up the pendulum to vertical upward position but to stabilize at the top we 
need a balancing controller. 

* * * 
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Chapter 5 

IDA-PBC Control of IWP 


5.1 Introduction 

The Passivity-based Control introduced by Ortega and Spong [33] defines a controller de¬ 
sign methodology which achieves stabilization by passivation. More precisely, the control 
objective is to passivize the system with a storage function which has a minimum at desired 
equilibrium point. The method is very successful for the simple mechanical systems that can 
can be stabilized shaping only the potential energy. In this case the closed loop system is still 
an EL system (as described in [34]) with the total energy being the difference between the 
stored and the supplied energies, hence the stabilization can be explained in terms of physi¬ 
cal energy balancing. But in particular we are interested in system Inertia Wheel Pendulum, 
which requires shaping of total energy of the system for stabilization. The draw back of 
PBC approach for these type of systems is that the closed loop storage function (typically a 
quadratic function as chosen in equation 4.11 of chapter 4) are not the energy function in any 
meaningful physical sense. Although the closed loop system is passive but the closed loop 
system no longer remains an EL system. 

In this chapter a new PBC theory called Interconnection and damping assignment (IDA) 
PBC [35] [29] which extends to a broader class of systems e.g. to applications that require 
shaping of the total energy. The special features of the method is that [36] (i) the physical 
stmcture (Hamiltonian) is preserved in closed loop, and (ii) the storage function of the passive 
map is precisely the total energy of the closed loop system. Some conditions (discussed later) 
on dissipation shows that total energy is the difference between the stored and the supplied 
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energies. 

The main distinguishing feature between the classical PBC and IDA-PBC is that in former 
we select storage function to be assigned and then design the controller that ensures the objec¬ 
tive (Le. by rendering the energy function non-increasing). On the other hand, in IDA-PBC the 
closed loop energy function is obtained by the solution of partial differential equation (PDE) 
as a result of our choice of desired subsystems interconnections and damping. Though it is 
not easy to solve PDEs in general but, for inertia wheel pendulum the chosen interconnection 
and damping matrices with some explicit conditions makes it easy to get the solutions. 


5.2 Port Controlled Hamiltonian systems 

For the above we depart from the EL description of the system and consider Port-controlled 
Hamiltonian (PCH) models [37] , which is a very large class of non-linear systems strictly 
containing the class of EL models. They result from the modeling of energy conserving 
lumped parameter physical systems with independent storage elements. Besides capturing 
the energy balance features of physical systems as in EL models, PCH models provides a 
classification of the variables (displacement and momentum) and the equations into those 
associated to phenomenological properties and those defining the interconnection structure 
related with the exchange of energy. They are therefore well suited to carry out the basic steps 
of PBC to shape energy function and adding dissipation. 

5.2.1 Port-controlled Hamiltonian model from EL and Hamiltonian model 

Let us recall the standard EL and Hamiltonian equation of motion. The standard EL equations 
are given as 

where q = (qi,..., q^)^ are generalized configuration coordinates for the system with k de¬ 
grees of freedom, the Lagrangian L equals the difference K — P between kinetic energy K 
and potential energy P, and r = (ti, ..., r*)^ is the generalized forces acting on the system. 
Furthermore and ^ are the column vectors of partial derivatives of Z<(q, q) with respect 
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to velocities qi, .,.,qk and positions qi,—,qh respectively. In standard mechanical systems the 
kinetic energy K is of the form 


K{q,q) = \fM{q)q 


(5.2) 


where ky-k inertia (generalized mass) matrix M{q) is symmetric and positive definite for all 
q. In this case the vector of generalized momenta p= {pi, —iPkY, defined for any Lagrangian 
L as p= is simply given by 

p = M{q)q (5.3) 

and by defining state vector (gi,..., qk,Pi, —iPkV (^-^ ^ second order equations of (5.1) 

transform into 2k first order equations as, 

riTT 

q = {=m~\q)p) 


where 

Hiq,p) = ^P^M-'^{q)p + P{q) (= ^fM{q)q + P{q)) (5.5) 

is the total energy of the system. The equations (5.4) are called the Hamiltonian equations 
of motion, and H is called Hamiltonian. The following energy balance immediately follows 
from (5.4): 

d d'^H. d'^H, .r 

which implies that the increase in energy of the system is equal to the supplied work {conser¬ 
vation of energy). 

It should be noted that if H is considered as the storage function, then equation 5.6 implies 
that the system given by equation 5.4 is passive from input u = r to output y = q. However, 
the stored energy function H can be negative since it consists of the potential energy term 
P{q) which can be also negative. The potential energy is always computed based on some 
reference level. This energy is usually positive above reference and negative below reference. 
If one can find the minimum potential energy of a system, then it is always possible to define 
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a positive terra for the potential energy. Let the lower bound on the negative value of the 
potential energy is C. If we redefine the Harailotian as: 

H{q,p) = + P{q) - C (5.7) 

then the Harailonian H can be treated as the stored energy function since its value is always 
positive. 

The system dynamics given by equation 5.4 can be written in more general form as 


q = 

P = 

y ' = 


(9>p) = (qi,-,qk,Pi,-,Pk) 
~{q,p) + B(q)u, ueW- 

B^iq)^(q,p) (= B^m, y € 


(5.8) 


Here B(q) is the input force matrix, with B{q)u representing the generalized forces resulting 
from the control inputs u E K"*. The state space of (5.8) with local coordinates (q,p) is 
usually called the phase space. And when m < k, the system becomes an under-actuated 
system. Because of the form of output equations y = B'^[q)q, the energy balance equation 
takes the following shape: 

^(,(t),p(i))=«^(t)!/(i) (5.9) 

If H is defined as given in equation 5.7, the Hamiltonian system given by equation 5.8 is a 
lossless state space system or a passive system. 

A major generalization of the class of Hamiltonian systems is to consider systems which 
are described in local coordinates as 


X = J{x)-^{x) + g{x)u, X € X, u e R"" 
ox 

V = 

Here J(x) is an n x n (we changed 2kx2ktonxnto make it convenient) matrix with entries 
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depending smoothly on x, which is assumed to be skew-symmetric, i.e. 


J{x) = —J^{x), (5.11) 

and X = {xi,..., Xn) are local coordinates for an n-dimensional state space manifold Be¬ 
cause ofequation(5.11), one can easily recover energy balance equation ^(x(t)) = u'^(t)y(t), 
showing that the system dynamics given by equation (5.10) is also lossless if JI > 0. The sys¬ 
tem dynamics in equation (5.10) with J satisfying equation (5.11) is called a Port-controlled 
Hamiltonian (PCH) system [37] with structure matrix J{x) and Hamiltonian H. 

5 . 2.2 Basic properties of Port-controlled Hamiltonian systems 

Port-controlled Hamiltonian (PCH) systems represented by equations (5.10) and (5.11) not 
only can be seen as generalization of classical Hamiltonian equations of motion, but they 
naturally arise from modeling of physical systems without dissipative elements. As described 
in equation (5.10), a port-controlled Hamiltonian system is defined by a state space manifold 
X with (J, g, H). The pair {J{x), g(x)),x € captures the interconnection structure of the 
system, with g{x) modeling in particular the ports of the system. Independently from the 
interconnection structure, the function H : R defines the total energy of the system. 

PCH systems have following properties: 

1. PCH systems are intrinsically modular i.e. parallel and feedback interconnection of 
a number of PCH systems again defines a PCH system, with overall interconnection 
structure determined by the interconnection structures of the composing individual PCH 
systems together with the interconnection between the systems. 

2. Hamiltonian for overall system consisting of interconnected PCH systems is just the 
sum of individual Hamiltonians. 

3. From the structure matrix J{x) of a port-controlled Hamiltonian system one can di- 
fectly extract useful information about the dynamical properties of the system. Since 
the structure matrix is directly related to the modeling of the system, this information 
usually has a direct physical interpretation. 
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Port-controlled Hamiltonian systems with dissipation 

The system dynamics given by equation (5.10) can be made dissipative by adding a term R{x) 
as follows: 


X = [J{x)-R{x)]—(x)+g{x)u 

V = (5.12) 


where R{x) is a positive semi-definite symmetric matrix, depending smoothly on x. In this 
case the energy balancing property of equation (5.8) takes the form 


dH 

dt 


{x(t)) = 


^^{x{t))Rix{t))^{x{t)) < u^{t)yit) 


(5.13) 


Thus the system becomes passive if the Hamiltonian H is given as per equation (5.7). The sys¬ 
tem dynamics given by (5.12) is called a port-controlled Hamiltonian system with dissipation 
(PCHD) [37]. In this case, the system consists of two geometric structures - (i) the internal 
power conserving interconnection structure given by J{x), and (ii) an additional dissipative 
stmcture given by R{x). 


53 Interconnection and Damping Assignment PBC 

The PCHD system dynamics (5.13)) forms a more general model of which a PCH system is a 
special case, i.e when dissipative structure matrix R{x) = 0. 

The energy balance equation (5.13) can be written in a different form by integrating both 
sides of this equation: 


/ 

Jo 


u^{s)y{s)ds = H{x{t)) — H(x(0)) 


supplied 


stored 


I 




ds 


(5.14) 


^. 

dissipated 
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which holds for all t > 0. If the energy function is bounded from below PCH systems define 
a passive operator from u ^ y with storage function H{x). In this case equation (5.14) 
expresses the fact that a passive system can not store more energy than that is supplied to it 
from the outside, with difference being the dissipated energy. Following properties of PCHD 
systems can be easily verified if the system dynamics are given as per equation (5.14): 


1. One can only extract a finite amount of energy from a passive system: Since the terms 
H(x{t) and j^^^(a:(5))i2(x(s))^(a;(s)) ds are always positive, it can be shown 
from (5.14)) that the following relation holds good: 


/ 


vF(s)y(s)ds < H(x{0)) 


(5.15) 


which shows that one can only extract a finite amount of energy from a passive system. 

2. The uncontrolled PCHD system will eventually converge to the state corresponding to 
minimum energy if the potential energy function is non-negative: When u{t) = 0 in 
equation (5.14), the relation H{x{t)) < H{x{Q)) holds good. This implies that the 
energy of the uncontrolled system is non-increasing. In fact, this energy will eventu¬ 
ally decrease in presence of dissipation until the system trajectories reach the point of 
minimum energy. 


3. The controlled PCHD with negative output feedback control converges to equilibrium 
point faster compared to the uncontrolled PCHD: Let the negative output feedback 
control be u = — Equation (5.12) thus becomes: 


OH, 


X = [J(x) - {R{x) + g{x)k^g^{x))] 


(5.16) 


If K = > 0, i.e. is a symmetric positive definite matrix, then the value of 

dissipative term is increasing. Due to increase in dissipative term, the rate derivative of 
the energy function, will decrease faster compared to uncontrolled PCHD. Thus the 
system will converge to equilibrium point faster. 

4. The PCHD system with negative output feedback can exhibit equilibrium points of in- 
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terest: It is clear that an open loop PCHD system has its own equilibrium points that 
may not be of interests to a control engineer. However, introduction of suitable negative 
feedback can take the closed loop system to any desired equilibrium point. 


5.4 The Control Problem in the Context of Passivity 


Given the PCHD system (5.12) and a desired fixed equilibrium point a:*, it is desired to find 
out a control action^ u = ^{x) + v such that the closed loop dynamics is a PCHD system 
satisfying the new energy balancing equation. 


- Hd{x{0)) = [ [u^(s)y'(s)]ds - dd{t) 
Jo 


(5.17) 


where Hd{x) is the desired closed loop stored energy function, which has a strict (local) mini¬ 
mum at a:*^. y' (which may be equal to y) is the new passive output, and the dissipation term is 
represented by some function dd{t) > 0, which is necessary to increase the convergence rate 
for the system to reach the equilibrium point x*. In equation (5.17) the term Hd{x) becomes 
zero or minimum when u = 0, and dd{t) = Hd{x{0)). This implies that as the dissipative 
term increases along the system trajectory, at x = x*, the closed loop stored energy function 
becomes zero due to the fact that dd{t) = Hdix(0)) at this desired equilibrium point. 

One of our main concerns is to understand the nature of the stabilization mechanism, and 
in particular what is the role of dissipation. We will give conditions under which the energy 
function of the closed-loop PCHD system is the difference between the stored and the supplied 
energy i.e. Hd{x{t)) = H(x{t)) - u{s)y{s)ds. In this case we will say that stabilization is 
achieved via energy-balancing. 

'Only the case of static state-feedback is considered in this work 

^That is, there exists an open neighborhood ^ of x» such that Hd{x) > Hd{x») for all i € ^ 
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5.5 IDA-PB Controller design 


The objective of a IDA-PBC is to find a static state-feedback control u = ^{x) (u = 0 is 
taken) such that the closed-loop dynamics is a PCH system with dissipation of the form, 

x = [Ux)-R,{x)]^{x), (5.18) 

where the new energy function Hd{x) has a strict minimum at the desired equilibrium x*, 
Jd{x) = —Jj(x) (skew symmetricity) and Rdix) = i?J(a:) > 0 (symmetric positive definite). 
The following method gives the theoretical part, which allows to effectively construct the 
controller. 


Design procedure 

Proposition 5.1 [35, 29] Given are functions J{x), Rix), H{x), g{x) and the desired equi¬ 
librium to be stabilized at x„. Assume that one canfirulfunctions p(x),Ra{x), Ja{x) such that 
Jd{x) ^ J{x) -b Ja{x) = -J^ix), Rd{x) 4 R{x) -f- Ra{x) = R^^{x) > 0, 
and a vector function K(x) : R” —* R” satisfying 


[Jd{x) - Rd{x)]K{x) = -[Ja{x) - i?a(x)]—(a;) -f g{x)f{x) 

and such that 

(i) (Integrability) K (x) is the gradient of scalar function. That is, 


dx 


{x) = 



{ii) (Equilibrium assignment) K{x), at a:*, verifies 


(5.19) 


(5.20) 


K(x.) = 



(5.21) 
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{Hi) (Lyapunov stability) The Jacobian ofK{x), at x*, satisfies the bound 


\ L/ JLJ. , ^ 

Under these conditions, the system (equation (5. 12)) in closed loop with u = fi{x) 4- v will be 
a PCH system with dissipation of the form (equation (5.18)), with new energy function 

Hd{x) ^ H{x) + Ha{x) (5.23) 


and 

riTT 

-^(x) ^K{x). (5.24) 

Furthermore, ifv(t) = 0, x* will be a (locally) stable equilibrium of the closed-loop. It will be 
asymptotically stable if, in addition, the largest invariant set under the closed-loop dynamics 
contained in 

|x e R”| 

equals {x»}. An estimate of its domain of attraction is given by the largest bounded level set 
{x e R" I Hd{x) < c} for some positive constant c .. 


dHd 


dx 


{X) 


Rd{tc) 


dHd 

dx 


(x) = o| 


(5.25) 


Proof: 

For every given /?(x), Ja{x)-, Ra{tc), (and on any contractible neighborhood of R”), the solu¬ 
tion of equation (5.19) is a gradient of the form (5.24) if and only if the integrability condition 
given in equation (5.20) is satisfied. Using equation (5.23) it is easy to see that, in this case, 
the closed-loop is a PCH system of the form (5.18) and total energy can be given as (5.23). We 
will now prove that, under (5.21), (5.22), the stability of the equilibrium is ensured. Notice 
that the equilibrium assignment condition (5.21) ensures Hd{x) has an extremum at x*, while 
the Lyapunov stability condition (5.22) shows that it is actually an isolated minim um. On the 
other hand, from (5.17) (with v = 0 and a suitably defined dd(t)) we have that, along the 
trajectories of the closed-loop, Hci{x{t)) is non-increasing, hence it qualifies as a Lyapunov 
function, and we can conclude that x* is a stable equilibrium. Asymptotic stability follows 
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immediately invoking La Salles invariance principle [24] [3] i.e. every solution starting from a 
set {x\Hd(3:) < 0} finally goes to the largest invariant set contained in the set {x\Hd{x) = 0}. 
Due to skew-symmetric property of Jd{x), [^{x)f Jd[x) [^(x)] = 0. Hence the con¬ 
dition (5.25). Finally, to ensure the solutions remain bounded, we give the estimate of the 
domain of attraction [2]. 


Discussion 

1. As given in [38] for the system of the form (5.12) we can fix Jo(a:) and Ra{x) and then 
look for the solution of the PDE 


g\x) [J(i) + Ux) - {R{x) + ^{x) 

f) ff 

= 

intermsof ifa(a:), where ^■‘■(a:) is aleftannhilatorofp(a;), i.e. g-^(x)g{x) = 0. This is a 
linear PDE of the form A{x) {x) = b{x). Then the control can be directly calculated 
using the formula 


0{x) = [g‘^{x)g{x)] ^p^(x){[J(x) + Ja{x) - (i?(x) -f Ra{x))] 

riff 

-f-[Ja(x)-i?a(x)]^(x)} 


2. For the general case, we can fix Ja{tc), Ra{^), then solve the algebraic equation (5.19) 
for K{x), which is trivial if the matrix [J(x) -f- Jaix) — [R{x) + f(a(3j)]] is invertible. 
Finally the integability condition (5.20) define a PDE directly for the control P(x). 

3. If there is no clear physical considerations for the choice of Jaix), Ra{x), we can simply 
select them to simplify the solution of the PDE. This procedure is adopted for inertia 
wheel pendulum to stabilise it about vertical upward position.. 
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5.5.1 Application to mechanical systems 

In this section, it will be shown how IDA-PBC can be applied to mechanical systems. Let the 
mechanical system under consideration has a total energy: 

H{x) = ]^X^M~''-{Xq)Xp+U{Xq) ' (5.26) 

where the state space has been partitioned into position and momenta components as x = 
[Xq, Xp ]^. The inertial matrix M{xq) satisfies the proprty of symmetric positive definiteness, 
i.e. M{xq) = M^{xq) > 0. The potential energy U{xq) is made non-negative as given in 
equation (5.7). The PCHD description (5.12 or 5.19) corresponds to constant interconnection 
and input matrices of the form 


(5.27) 

and it is assumed that the system has no damping, i.e. R{x) = 0. 

It is required to stabilize the equilibrium point of the form x* = [x^*, 0]^. According to 
proposition 5.1 [35] matrices Jaix) and Raix) has been assumed as: 


J = 


0 I 
-I 0 


= 


0 

9p{Xq) 


Ja{x) 

Ra(x) 


0 

-JfiXq) J2{x) 



0 0 
0 R2{,x) 


^R^{x)>0 


(5.28) 


with Ji{xq), J 2 (x), R 2 {x) free parameters to be defined later. One thing to be specified here, 
for the above equality conditions to be satisfied the free parameters should be chosen in such 
a way that J 2 (x) = — Jj’(x) and R 2 {x) = i ?2 (x) should be satisfied. Under these conditions. 
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the PDE (5.19) reduces to 


0 I 


' K,(x) ' 


0 Jl{Xq) 

-I 0 


K,(x) 


-J^{Xq) J2 {x)-R2{x) 


+ 


0 

gp{xq)P{x) 


^^{x)+Kp{x) 


(5.29) 


where we have also partitioned the gradient of Haix) (5.24) as 




A 

K,(x) 


1 - 

1 _ 






(5.30) 


From equations (5.12) and (5.26), one can observe that if the interconnection matrix Jd{x) for 
closed loop PCH ssystem is not modified (i.e. Jd{x) = J{x) with the additional interconnec¬ 
tion matrix Ja(rr) assigned to 0), then the kinetic energy of the closed loop system can not be 
shaped, the reason behind this is as follows: 

Assigning additional interconnection matrix Jaijc) = 0, it is clear from first | equations of 
(5.29), that 


K^{x) = 0 


From Proposition 5.1 it is known that 


Kp{x) = 


dHaix) 

dxp 


Hhence = 0 is true if closed loop interconnection is not modified, which tells that 

the additional energy function Ho,{x) is not a function of Xp, it is a function of Xq only, i.e. 
Ha{x) = H{xq), hence there is only potential energy term in the additional enrgy function 
Ha{xq). So only potential energy of the closed loop system can be shaped, hence it is not 
possible to shape the kinetic energy if interconnection matrix is not modified. 

For fully actuated case where gp{xq) = J, one can assign arbitrary potential energy and 
thus can add damping in all input channels. The under-actuation puts severe constraint on the 
method to get arbitrary potential energy. 
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As the additional energy term Ha{x) plays a measure role in shaping both kinetic and po¬ 
tential energy, thus the free parameters of matrices J\{xq), J 2 {x) mdR 2 {x) shown in equation 
(5.28) are chosen in such a way that the PDE of equation (5,29) admits a solution of the form 

Haix) = ^XpN(Xg)xp + H'^ixg) (5.31) 

for some system matrix N(xq). To shape the Xp component of Hii(x) we require M~^{xq) -\- 
N(xq) to be positive definite. On the other hand, thepotential energy term of Haix) i.&.H'^{xq) 
will need to dominate the open loop system potential energy term U{xq) to keep Hd{x) pos¬ 
itive definite. From the first | equations of (5.29) an expression of matrix N(xq) in terms 
of M{xq) and Ji{xq) can be obtained. To solve last | equations of (5.29) it is required to 
eliminate dependence with respect to Xp to find out the solution of PDFs. Here is the method 
to present our main result. 

How to find out the function Haixq) 

Proposition 5.2 [29] To find out Ha{xg) consider the mechanical system of the form (5.12) 
with the system total energy given in equation (5.26), the system matrices given in equation 
(5.27) and the desired equilibrium to be stabilized x* = 0]^. It is assumed that one can 

find out functions P{x), R 2 {x) = H^{x) > Q, J\{x), J 2 {x) = —Jfix) and a vector function 
K'(xq) : K? —> Ra satisfying, 

ll + Jf(x,)]K',(x,) = + + ' ■ 

+ [J2(a:) - R2{x)][I -I- (Xq)]~'‘-M~'^{Xq)Xp 

-gp{xq)fi{x) (5.32) 

{where S{x) = rcjfj -1- {xq)]-^Jx{xq)M-^{xq)xp) 

and such that 

(i) (Integrability) Kg{xg) is the gradient of a scalar function. 
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{ii) (Equilibrium assignment) Kg{xq) at Xq verifies 


— Q^2 (^9*) (5.33) 

(Hi) (Lyapunov stability) The Jacobian ofK'^ixq^), at Xg*, satisfies the bound 

(5-34) 

and[I + J[{xq)]-W-\xq) is a symmetric positive definite matrix. 

Under these conditions the system (5.12) in closed loop with u = fi{x) is a PCH system with 
total energy. 


Hd{x) = + J^{Xq)]-^M-\Xq)x^ + U{Xq) + H',(Xq) (5.35) 


where 

-g^M = K',(x,) (5.36) 

and the interconnection and damping matrices are of the form 


Jd{x) 

Rd{x) 


0 / 4- Jl{Xq) 

-I-jfiXq) J2{X) 


0 0 
0 R2ix) 






respectively. Furthermore, x* will be a locally stable equilibrium. It will be asymptotically 
stable if condition of equation (5.25) satisfies i.e in addition to above the largest invariant set 
under the closed loop dynamics contained in 


X € 


dHd 


iT 


dx 


(X) 




equals {x*}. 
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Proof: 


To carry out the proof it is required to show that, for this particular case of the mechanical 
system, the conditions of general proposition 5.1 reduces to those of proposition 5.2 
From the first | equations of (5.29) it is obtained that 


K,(x) 


Jxip'q) 


dH 

-1 


(x) 4- Kp(x) 


Ji(xg)[M ^(xq)xp + Kp(x)] 


(5.37) 


As Kp{x) = postulating a solution of Ha(x) of the form (5.31) one can get that Kp{x) = 
N(Xg)Xp, so equation (5.37) becomes N{xg) = — Ji(x 5 )[M~^(xg) + N(xg)] or N(xg) = 
—[/ + Ji (xg)]~^JiM~^(xg). Assuming invertibility of [I + J\ (a;,)], one can see, this is equiv¬ 
alent to 


M-\Xg)+N{Xg) = J:[\Xg)N{Xg) 

^M-\Xg)+N{Xg) = [I + Jl{Xg)]-^M-\Xg) (5.38) 

which, under conditions of proposition is symmetric and positive definite. Hence, in its Xp 
coordinates Hd{x) (5.35) has a minimum at the desired point. 

For the last | equations of (5.29), with the postulated solution of (5.31) it can be deter¬ 
mined that 

K,(x) = i A {4N(x,)x,} + K',{x,) (5.39) 

where Kg(xg) is defined by (5.36). 

As 


dHa d 


dXg 

iJL 

2dXa 


-X^N(Xg)Xp + H'^iXg) 


\xp N{xp)xp^ 4 - Kg{xg') 


dH: 


dx, 


- = Ki^,) 


(5.40) 
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And from last | equations of (5.29) it is clear that 


-Kq{x) = jl{Xq) 

Soreplacing Arg(x) by (5.40) and replacing 7V(rrg) by —Ji{xg)[M~^{Xg)+N(xq)] one can get 
final expression of (5.32). The proof is completed noting that under the remaining conditions 
of the proposition 5.2 the function (5.35) has a local isolated minimum at rc*. 


dH 

dXa 


+ Kg{x) 


R2{x)) 


g-+ir,(x) 


+ 9p{^g)P{x) 


(5.41) 


5.5.2 Inertia Wheel Pendulum System 

In this section the Proposition 5.1 is applied to Inertial Wheel Pendulum to stabilize it at 
vertical upward position. It is shown here that IDA-PBC defines a continuous control law 
that “almost” globally stabilizes the upward position, if no actuator saturation takes place. 
The “almost” term makes the controller conservative because the basin of attraction of the 
desired equilibrium point is an open dense set in the state space, which excludes some intial 
conditions with which the state trajectories do not converge to the desired equilibrium point. 

Model 

The dynamic equations of inertia wheel pendulum with the angle (pendulum angle and wheel 
angle) reference along vertical upward axis can be written as (from equation (2.8) of Chapter 
2 ). 


j 0 


Xgl 

+ 

—mgl sin Xqi 

_ 

—T 

(5.42) 

0 jr 


Xq2 


0 


r 



where [xqi^Xq-^ = of which ipp,il}r are pendulum and rotor angle respec¬ 

tively. At vertical upward position [V’p, ipr] = [0,0]. Defining the state a; = [xq,Xp]'^ = 
[Xq, Mxq]^, with M = diag{yV}, the total energy is 

Hix) = ]-x^M~^Xp -h mgl{cos Xqi-1) (5.43) 

A 
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and the PCH model takes the form of (5.12) with gp{Xq) = [—1,1]^, which can be shown as, 


iql 



—mgl sin Xgi 


0 

Xq2 


0 / 


0 

+ 

0 

ipl 


-I 0 


EeI 

3 


-1 

Xp2 



^p2 

Jr 


1 


(5.44) 


The equilibrium to be stabilized is the upward position with the inertia disk aligned, which 
corresponds to x* = 0. 


Controller Design 

The IDA-PBC is designed here in two steps. First a state feedback control is determined 
which shapes the closed loop energy to stabilize the upward position of pendulum, finally the 
damping is added to closed loop system to ensure asymptotic stability. The next task is to find 
out the region in which the controller works for the constrained control input Inertial Wheel 
Pendulum system. 


Energy Shaping 

For the inertia wheel pendulum system, the inertia matrix M is independent of Xq. Hence 
recalling the discussion in the proposition 5.1, one can take J 2 {x) = R 2 {x) = 0. Also it will 
be clear very soon that, taking Ji to be a constant matrix finding the solution of PDFs become 
easier. With these considerations the PDF to be solved (5.29) takes the form 


0 J +Jx 

-{I + JiY 0 


Kq 

Kp 


JT 


-JiM~'^Xp 




0 


—mgl sin Xqi 


+ 


-1 

r 

0 




1 



(5.45) 


Separating out the equations one can get, for first two rows Ji(M~^Xp + Kp) = —Kp. A 
solution of the form Ha — \x^Nxp + H'^ixq) is postulated for the equation (5.45) according 
to equation (5.31), with N = N'^, and H'(xg) to be defined later. In this case one can easily 
get + N)xp = —Nxp which, under the assumption of invertibility of (/ + Ji), is 
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satisfied if and only if + N = (I + Let 


a d 
h 0 

with a, b, d constants to be assigned by defining some conditions. As TV = 
substituting matrix parameters of Jj and M one can easily get 



iV = 


bd^a 




j{l+a-bd) jr{\’^a—hd) 

-b bd 

j^l+a-bd) jr(14-a~M) J 


symmetric condition N = iV^ imposes b = j-d, so matrix N becomes 


3r _ 


-d 


N = 

So sum + N comes out to be 

M-^ + N ■■ 


Jil+a-jpcP) jril+a.--^cP) 


~d 


J£. 


jr(l+a—^(P) j2(l+0-^d2) 


-d 


jil+a--^cP) jr{l+a-^(P) 


-d 




V 

which needs to be positive definite to shape the Xp components of Hd(x). To satisfy this 
condition following inequality must be true 


1 + a - > 0 

3r 


...Condition I 


(5.46) 


Now looking into third and fourth equations of (5.45) which is in form of 


— (J + Ji)^ Kq = Ji 


-mgl sin Xqx 
0 


+ 


—r 

T 


(5.47) 


and replacing Ji, h and adding up both the rows, a simple PDE in following form is obtained 
(1 4- a + d)Kqx + (1 + j^d)Kq 2 = (a + d)mgl sinx^i 
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If (1 + o + a’) / 0 and (I + j^d) 0, then this PDE admits a family of solutions 




(I d 


1 •{■ Q + d 


mgl{cosXqi — 1) + ^»(z) 


(5.48) 


where #(•) is an arbitrary (continuously differentiable) function with argument 


^ i+!Ld^92 

Jr 


So the new potential energy (Let’s say it (/^•(x)) 

Un{x) = mgl{cosXqi - 1) 4- H'' = cosXqi - mgl 4- $(z) 
This new potential energy function will have an extremum at x, = 0 if and only if 




dx, 


(x) 


9l J i5=0 


mgl . d^(z) 

-sinxgi4- 


1 4” u 4" 

a$(z) 


= 0 


a;g=0 


dz 


= 0 


Iq=0 


and 


dx, 


q2 


(x) 


Xq=0 


1 -f* d “h 0? 


a$(z) 

dz 


= 0 


Xg=0 


Hence from the equations (5.49) and (5.50) the following condition is true 


(5.49) 


(5.50) 


a^(o) 

dz 


= 0 


(5.51) 


It will, furthermore, be a minimum if is positive i.e., the matrix 


d^Ui 


M- 


d’^UN 

dXqldXq2 

d^UM d^UN 

dXq2dXqi dx’^q^ 


is positive definite. Hence following conditions must be satisfied 


dl^Ui 


N 


J.,=0 


d'^^z) 


dz^ 


> 


mgl 5^$(zj 

- -— cosxgi 4- .a I 

1 4- a 4- a dz^ 

mgl 


>0 


iCqr—O 


J Xq^O 


1 4" fli 4" d 


(5.52) 
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and 


—(1 + a + rf ) 


dz^ 


> 0 


Clearly, a necessary condition for this is 


1 + a + d < 0 ... Condition II 


(5.53) 


In which case it is sufficient to take 


> 0 (5.54) 

To verify (5.52) and (5.54) a quadratic function $(z) = has been proposed with > 
0. Finally, to verify the conditions of proposition 5.1 the parameters a and d should satisfy 
conditions given in equations (5.46) and (5.53) whose intersection is clearly a non empty set. 
So the final expression of additional energy term Ha is. 



H “ —— 

“ 2^ P 1+a+d 


^ ^ mgl{cos Xgi - 1 ) + ^ (Xgi + 72 X^ 2 ) 


(5.55) 


The energy shaping stage of the design is completed evaluating the control law from (5.47) 
(Taking any one of the two equations), taking the first equation of (5.47), i.e. 

—(1 + a)Kqi — bKq 2 = —amgl sin Xqi — r 

and substituting functions Kqi, Kq 2 , b, $( 2 ) and 2 ; one can easily get the energy shaping con¬ 
trol law as follows: 

r = 7i sin Xqi + Kp (xqi + 72X52) ( 5 . 56 ) 

1 1 __ 1-eft 

where Kp = Kp > 0 and 

A dmgl A l+a+d 

'1 l+o+d> '2 l+j-d 


Damping Injection 

The controller above yields the conservative closed loop system x = Jdr^{x), Ja = —Jj, 
with the origin a stable equilibrium, because {0} = arg min{fr(i(a:)}. The equilibrium point 
can be made asymtotically stable with damping injection, which is achieved by feeding back 
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the passive output y' = g^{x)^^{x), the passive output y' of closed loop system is deter¬ 


mined as follows: 


y' = 


0 0-11 


Mi 

dXql 

Ml 

dXg2 

M 

dxpi 

M 

Oxpa 


Which comes out to be 


1 -t- / j 

y = —7 -+ 'y2~r^p2 

j(l+a-j;d^)\ 3r 


(5.57) 


Taking the complete control law is 


r = 7 i sinsqi -f- Kp{xqi -f 720 : 92 ) - Ky' 


(5.58) 


where is a positive proportionality constant, the constant k is always negative due to the 
conditions 5.46 and 5.53. So the above equation (5.58) can be rewritten as, 


T = 7i sin Xqi + Kp(xqi -b 72 o :, 2 ) + Ky 


^pl + l2~rXp2 

Jr 


(5.59) 


where Kp, Ky > 0 and Ky = —nky. Then control input to the PWM card is u = r/0.00493 
[as discussed in Chapter!]. By substituting r from equation (5.59) in place of ^(a:)in equation 
(5.19) and replacing matrices J, Ja by their elements, we can represent the additional negative 
output feedback term in control input r in terms of elements of damping matrix Ra- 


Assymptotic Stability 

It is required to prove that the vertical upward position of pendulum is asymptotically stable 
which is proved in terms of convergence of closed loop energy function H^. The closed loop 
energy for mechanical systems is shown in equation (5.35). So closed loop energy for the 
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pendulum system is calculated as 


1 

Hd = ~Xp [M ^ + N) Xp + ^(cosa:gi — 1) + -^{xqi + 72 X 52 )^ (5.60) 

As [M~^ + A"] is possitive definite, d < 0 and —1 < (cos Xgi) < 1, thus all terms of Hd are 
> 0, which states that Hd > 0. Time derivative of closed loop energy function Hd is, 

^ ^_dHd^ ^dHd^ ,dHd^ , dHd^ 

H-d o “h Q d" Q ^pl o ^p2 

OXqi OXq2 OXpi dXp2 

Substituting the partial derivative terms and time derivative of states in the above equation and 
simplifying one can get, 

Hd = + 72-Xp2f (5.61) 

J-^p Jr 

which is a negative semidefinite function, this implies that Hd is a non-increasing function. 
Hence Hd finally converges to a constant. As energy Hd is remaining bounded, observing 
equation (5.60) one can easily get that all states of the system i.e. (Xqi, Xg 2 , Xpi and Xp 2 ) 
will remain bounded. Thus LaSalle’s invariance principle can be applied here[24]. In order 
to apply LaSalle’s theorem , it is required to define a compact set fl with the property that 
every solution starts in this set remains in it for all future time. Again it is required to define 
an invariant set F, the set of all the points at which Hd = 0. From equation(5.61) in set F, 
Hd = 0. And from this, it follows that {xpi -t- 72 ^^ 2 ) = 0 in F. Which implies, in the 
invariant set F 

Xp2 = —^Xpi (5.62) 

JI 2 

So in invariant set the control input r takes the form 

T = 7 i sin Xgi -J- Kp{xqi + 722 : 92 ) (5.63) 

As in invariant set Hd = 0, it means Hd is constant in invariant set, by carefully observing 
the closed loop energy function Hd in equation (5.60) it is clear that Hd will remain constant 
when all the terms involved in it remain constant. From this observation it is clear that Xqi is 
a constant, thus Xpi = 0 and hence equation (5.62) shows that Xp 2 is also 0. 
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The rate of system energy can be determined from equation (5.43) to be i? = ^ ^ j 

as both Xpi and Xp 2 are equal to 0 in invariant set F, thus i? = 0 in the set. Whis implies H 
must be a constant. Two cases for the system energy are considered in invariant set, those are 
if = 0 and if = if ^ 0 for some constant K. , 

1. if = 0 : As ii = 0, from equation(5.43) we get As Xpi — 0, hence Xp 2 = 0. So kinetic 
energy of the system \x^M~^Xp = 0. The system energy 

if = ^x^M~^Xp + mgl{cosXqi — 1) = 0 (5.64) 


Making use of equation (5.62) we get 



±1 + mgl{cosXqi — 1) = 0 


Since^ ^1 + ^ j is constant, above equation defines a perticular trajectory called Ho¬ 
moclinic orbit of the pendulum in the (x^i, Xgi) phase plane. Hence pendulum moves 
in clockwise or anti-clockwise to reach equilibrium point. Observing equation (5.64) 
we get the potential energy of the system mgl{cosXqi — 1) = 0. Hence Xqi = kir for 
k even. So equation(5.60) takes the form Hd — which must be a 

constant, to which the closed loop energy finally converges (as Hd is non-increasing). 
This is the potential energy of the closed loop system when mgl{cosXqi — 1) = 0. As 
per the design (in equation(5.51)) of the controller the closed loop potential energy Un 
is minimum atz = (xqi -f- 72 X 52 ) = 0 X 52 = —Hence equation (5.59) shows r 

is also = 0. So the position of pendulum with states [fcTr, — 0, 0] for k even, which 

is the upward position of the pendulum, is asymptotically stable as this position is the 
only largest invariant set in set F 

2 . H = K ^0: Since in invariant set Xpi = 0 , kinetic energy of the system \x^M~^Xp — 
0 , this implies that potential energy mgl{cos Xqi — 1 ) is equal to constant K, which gives 
Xgi is constant. As the kinetic energy of closed loop system is 0 (as Xpi = 0), closed 
loop potential energy Uiq = ^ (cos Xgi — 1 ) -f ^ {xgi - 1 - 72 X 52 )^ which must be a constant 
and minimum, to which closed loop energy Hd converges. As per the controller design 
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Un is minimum at 2 = (xgi + 722^92) = 0. With Xpi = 0, from the system dynamic 
equation (5.45) one can get, 

Xpi = mgl sin Xgi — u 

= mgl sin Xqi — 71 sin Xgi + Kp(xgi + 720:92) as Xpi = 0 
= sinxgi - Kp{xgi+'y2Xg2) (5.65) 

JL "T" a 1 “ a 

As Xqi is constant, Xpi = 0. Substituting 2 = {xgi + 72X52) = 0 in equation (5.65) 
one can get r^^^mgl sinxgi = 0, which gives Xgi = kir for k even =» Xq 2 = 

Hence the pendulum system energy H = K = 0, which gives the same conclusion as 
first case. 

5.5.3 Simulation Results for IDA-PBC on IWP 

Response of the pendulum with IDA-PBC are shown in figures [5.1-5.2]. The system param¬ 
eters remain same as specified in Chapter 1. The controller is tuned to satisfy the Conditions 
I (5.46) and H (5.53) i.e. 

14-0 — ^ 0 and (14-04-d) <C0 

Taking the specified parameters j = 4.572 x 10“^ and jr = 2.495 x 10“^ the above two 
conditions can be combined as 

= 183.2465 < 1 4- o < -d (5.66) 

Jr 

The parameters o, d and c should be choosen to satisfy above condition (5.66) and also to make 
the controller response faster than the system response. Or in physical sense, it can be said 
that around vertical upward position the controller must be faster enough to pull the pendulum 
to exact upward position, before it falls due to any small external disturbances like noise. 
Keeping these two things in mind, a and d are tuned to —0.9979 and —0.003 respectively. 
With these parameters we get 71 = 1.194816, 72 = 0.001999 and k = -218469.846066. 
Choosing AT' = 0.1 > and k^ = 0.0001 we get Kp = 0.0001 and Ky = 21.846985. 
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(a) Pendulum Angle Xai (b) Pendulum Angular Velocity Xqi 




(c) Wheel Angule Xa2 (d) Wheel Angular velocity Xa2 




(e) Torque Input u (f) Closed loop Energy Hd 


Figure 5.1: Simulation results of IDA-PBC (x^i ~ t, Xqi ~ t, Xq 2 ~ t, Xq 2 ^ t, u t, Hd 
t) 

Some points have been discussed in subsection (5.5.3) regarding tuning the parameters for 
applying the controller on practical system. The following observations are made from the 
simulation results shown in Figures 5.1 and 5.2.. 


1 . Pendulum goes from vertical downward position to vertical upward position in one 









Figure 5.2: Simulation results of IDA-PBC {Hp ~ Xgi ~ Xgi 

swing and stabilizes at the top with a damped oscillation. 

2 . Pendulum angular velocity finally converges to zero. 

3. From the plot of wheel angle for a period of 3 seconds, it appears to attain a final value 
of 1600 radian, but actually it converges to zero very slowly and may take a huge amount 
of time (> 5000 seconds). 

4. Wheel angular velocity finally converges to zero, but the peak angular velocity becomes 
as large as 6000 radian/second and it requires a very large control input of about 175 
unit. These value are difficult to achive in real time systems. 

5. Closed loop energy is non increasing, which converges to zero as the pendulum reaches 
the vertical upright position. 

Limitations 

The simulation results for IDA-PBC to bring the pendulum from vertical downward po¬ 
sition to vertical upward position, are shown in figures 5.1-5.2. The figure 5.1(e) shows 
that the maximum value of control input required in this case is ±110, which is beyond 
the specified limit of ±10, hence global stabilization can not be achieved on practical sys¬ 
tem by this controller Le. the downward equilibrium point [tt, tt, 0 ,0] does not lie in the 
domain of attraction of vetrical upward equilibrium point with limit on the control input. 
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Then what is the domain of attraction of upright equilibrium point with limit on control 
input ?. From the simulations and experiments it is determined that with the initial states 
[xqi = ±0.12rod, Xq 2 = TlbOOrad, Xqi = ^l.Qrad/sec, Xq 2 = iSOOrod/sec], vertical up¬ 
ward position is asymptoticaly stable with the limit on the control input. Hence this controller 
can be used as a balancing controller. If the initial conditions are within the above specified 
range then the pendulum finally goes to the upright position. The simulation results for bal¬ 
ancing control are shovm in figures [5.3-5.5] along with experimental results. The figures 
[5.4(e), 5.4(f)] clearly shows that the control input (|u| < 10) is within specified limit, which 
brings the pendulum from Xqi = 0.12 rad to vertical upward position Xqi = 0.0 rad. 

5.5.4 Simulation and Experimental results of Balancing IWP by IDA- 
PBC 

Due to the above said limitations on practical system, here IDA-PBC is used only for balanc¬ 
ing control. For swing up a modified controller is used, which is discussed in next subsection 
with simulation and experimental results. Both simulation and Experimental results for bal¬ 
ancing the pendulum are obtained with the same parameters tuned in subsection (5.5.3). While 
tuning the parameters in addition to satisfying conditions (5.46) and (5.53) following points 
should be kept in mind,(z) With constant a and ky, as d decreases (i.e. increases in negative 
direction) settling time as well as overshoots of pendulum angle Xqi decreases, {ii) With con¬ 
stant d and as a decreases, time interval between consecutive zero crossing of pendulum 
angle Xqi decreases, which is required to damp out the small oscillation around top position 
dixring stabilisation. (Hi) with constant a and d if k^ increases, damping injection increases, 
which ultimately decreases settling time. From the practical observation it is observed that 
during stabilisation, time interval between two consecutive zero crossing of pendulum angle 
Xqi should be < 0.35 sec for better performance of stabilisation. The simulation and experi¬ 
mental results are compared in figures 5.3,5.4 and 5.5. Here are some close observations of 
the results. 

1. The pendulum angle exactly converges to zero in simulation, but there is a small error 
in experimental result due to offset in angle measurement by optical encoder. 
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Simulation Results Experimental Results 



(a) Pendulum Angle Xd (b) Pendulum Angle Xqi 



(c) Pendulum Angular Velocity Xqi (d) Pendulum Angular Velocity Xqi 


Figure 5.3: Simulation and Experimental Results of Balancing IWP by IDA-PBC (Xgi ~ 

t, Xqi ~ t) 
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Simulation Results 


Experimental Results 




(a) Wheel Angle Xq2 


(b) Wheel Angle Xqi 



(c) Wheel Angular velocity Xa2 W) Wheel Angular velocity Xa2 



(e) Control Input u 


(f) Control Input u 


Figure 5.4: Simulation and Experimental Results of balancing IWP by IDA-PBC {Xq 2 ~ 

~ t, U ~ t) 
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Simulation Results 


Experimental Results 




(c) Pendulum Energy Hr, (d) Pendulum Energy Hp 




(e) Phase Plot Xqi 


(f) Phase Plot a;gi ~ Xgl 


Figure 5.5: Responses of balancing IWP by IDA-PBC (Hd ~ t, Hp ~ t, Xgi ~ Xqi) 
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2. The pendulum angular velocity converges to zero in simulation, but in experiment, it 
fluctuates around zero due to amplification of measurement noise in pendulum angle. 

3. The wheel angle converges to zero in simulation after a large time of 5000 seconds, 
but in experiment,it is not possible to collect data for such a long time due to hardware 
limitations. We could collect data for only 4 seconds. But still, one can observe that the 
wheel angle is slowly increasing. 

4. The Wheel speed should converge to zero as per the simulation results, but in real time 
experiment it settles to some steady state value. 

5. The control input is very smooth in simulation but experimental results show a lot of 
noise. 

6 . Closed loop energy plot of both simulation and experimental results are quite similar. 
The energy decreases and attains a stedy state value as soon as pendulum reaches the 
vertical upright position. 

5.5.5 Swing up Controller 

to bring the pendulum from vertical downward position to vertical upward position and stabi¬ 
lize it at upright position.From last subsection it is clear that due to limit on control input, with 
IDA-PB controller a small region of pendulum angle [xqi < ±0.12) lie in the domain of at¬ 
traction of upward equilibrium point. So the controller cannot bring the pendulum from, down¬ 
ward position to upward position. To do the job it is required to design a swingup controller, 
which can bring the pendulum from vertical downward position to the domain of attaction 
of the vertical upward position. Once the pendulum reaches into the domain of attraction of 
vertical upward position, switching can be done from swingup to balancing controller, which 
stabilizes the pendulum asymptotically about upward position. As per the design of IDA-PBC 
a damping is injected to the closed loop energy, by which the closed loop energy Hd converges 
to zero, at stable equilibrium point. With no limit on control input, as the pendulum moves 
from downward position towards upward position, the closed loop energy Hd decreases with 
the system energy H increases from —2mgl at downward position to 0 at top position. In 
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this subsection to design swing up controller we concentrate on the pendulum energy H in 
stead of closed loop energy Hd. The contol law derived in equation (5.59) is rewritten here 
for convenience, 


u = 7i sinxgi + Kp(xgi + 72 ^ 92 ) + + 72^a:p2 

Energy Shaping i—— - 

Damping Injection 

The first term shapes both kinetic and potential energy of closed loop system and second term 
injects damping. Second term is a negative feedback of closed loop system output which 
damps out the closed loop system energy, such that the closed loop energy converges from a 
positive value to 0. Now lets see the effect of positive output feedback. With positive feedback 
the control input becomes, 

u = 7i sina;,! + Kp{xqi + 72 x^ 2 ) - (^Xpi + 'y2j;Xp2'j 
with this time derivative of closed loop energy Hd comes out to be, 

Hd = (^^Pi +72^2;p2^ (5.67) 

Equations (5.60) and (5.67) imply that Hd ia non-decreasing function. This is the case for 
unconstrained input case. With limit on the control input the above equation becomes too 
complex, from which we can not infer about the sign defeniteness of the function Hd- As 
we intend to design swing up control, which requires shaping of pendulum energy Hp = 
2 “^—^ mp/(cosXgi — 1) System energy H excluding rotor energy, we will consider only 
pendulum energy Hp. So we do not require the closed loop energy shaping term. Neglecting 
this term from control input, we get 



u = —Kv 


^pl + 72—Xp2 

Jr 


which is the positive output feedback term. With this rate of pendulum energy. 


(5.68) 


Hp = Ky^{Xpi + '^2^Xp2) 

3 Jr 


(5.69) 
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With limit on input the above equation becomes 


Hp = —^ X sat 
J 


Kvi^pi "J” T2 • ^^2) 
3r 


where sat[u] is defined as. 


sat[u] = < 


U 

-U 

u 


ifu>U 
ifu < -U 
otherwise 


(5.70) 


(5.71) 


where (7 is a positive quantity, +U and —U are the maximum and minimum limit on control 
input, respctively. Due to presence of term 'Y 2 ^Xp 2 In equation (5.70), we can not say anything 
about the sign definiteness of function Hp. So we modify the control law as follows 


U - Hy (jCpI ) 


(5.72) 


one can see that this is no longer a positive output feedback control law, rather a positive 
partial state feedback control law, in which proportional of only one state i.e. Xpi is fedback. 
With this modified control law rate of pendulum energy takes the form, 

Hp = Ky-^ with no limit on control input 

Hp = —^^^xsat[—Ky{xpi)] with limit on control input (5.73) 

So equation (5.72) implies that Hp is a non-decreasing function. Hence the positive partial 
state feedback control injects energy into the system. But it does not tell us that the state Xp\ 
will remain bounded. To keep Xpi bounded, the pendulum energy must be kept bounded. So 
we put a bound Hp < 0, where JTp = 0 at top position of the pendulum-. This bound on Hp is 
put with the control law 


-Ky{xpi) if JTp < 0 

0 otherwise 


(5.74) 
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With this control law Pendulum energy Hp reaches to 0, it then follows that 


1 

Hp — + mgl(cosXqi — 1) = 0 

^ 7 


Since j is a constant, the above equation delines the homoclinic orbit of the pendulum in 
(xgi, Xqi) phase plane. Hence the pendulum moves clockwise or anti-clockwise until it reaches 
to top position. 


5.5.6 Simulation and Experimental results for Complete stabilisation 

Simulation and experimental results for complete stabilisation with combined control are 
shown in figures 5.6-5.8.The control parameters, tuned in subsection (5.5.3) are used for the 
contoller design. Figures 5.6(a),5.6(b) shows the plot of pendulum angle Xgi with time.With 
swing up algorithm the pendulum oscillates with increasing amplitude until Xgi reaches within 
the range ±0.12 rad. Due to bound on energy and limit on input, state variables Xgi and Xg 2 
reaches into the domain of attraction of. The energy Hp of the pendulum increases from 
—0.7133 W — m to 0 W — m as it approaches upward position from’downward position. The 
closed loop energy H^i, shown in figure [5.8(a),5.8(b)] varies in a different manner. The en¬ 
velop of the closed loop energy increases, the lower boundry of this is established by closed 
loop energy at the instants pendulum passes through vertical downward position and upper 
boundary by the closed loop energy at the instants pendulum goes to peak in clockwise or 
anti-clockwise direction in each swing. It reaches to 0 when pendulum reaches top position. 
One thing is clear from the closed loop energy plot, if we consider one swing from downward 
position upto a peak in any direction, the closed loop energy is increasing, which is nothing 
but the effect of positive state feed back control input. As soon as the pendulum reaches within 
the domain of attraction, we switch over to balancing control which stabilises the pendulum 
about top position. 
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Simulation Results 


Experimental Results 
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(a) Pendulum Angle 



(b) Pendulum Angle Xni 



(c) Pendulum Angular Velocity Xai 



(d) Pendulum Angular Velocity Xqi 




(e) Wheel Angle Xq2 


(f) Wheel Angle Xq2 


Figure 5.6: Responses of Complete Stabilization (x^i ~ Xgi ^ Xq2 ~ t) 

5.5.7 Concluding Remarks 

The IDA-PBC is succesfully implemented on inertia wheel pendulum for stabilising it about 
vertical upward position. The real complexity of the IDA-PBC design methodology lies in 
solvability of PD.Es. The additional matrix Ra{x) for IWP is chosen to make the solution of 
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Simulation Results 


Experimental Results 



(c) Phase Plot Xqi ~ i,! (d) Phase Plot ~ i,! 


Figure 5.8: Responses of Complete Stabilization {Ha ~ t, Xgi ~ Xgi) 
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PDE simple. The matrix Ra{x) came directly from nagative output feedback damping term 
of control input u. With IDA-PBC method a balancing controller was designed to stabilise 
the pendulum at top position. The Hamiltonian structure is preserved in closed loop by this 
control. For the limited input practical system case the stable equilibrium point is not globally 
stabilizable with IDA-PBC method, so a novel swing up controller is designed with some 
modification in the balancing control, which is a positive state feedback control instead of 
output feedback. In this control, some property of Hamiltonian stmcture is preserved which 
was observed in closed loop energy function for a portion of pendulum swing. One advantage 
of this combined algorithm is that, it is much faster than the other two control techniques 
implemented in previous chapters. 

* * * 
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Chapter 6 
Conclusion 


In this chapter we make concluding remarks and state the scope for fumre work. 

6.1 Summary 

In this thesis, various kinds of dual mode controllers for inertial wheel pendulum (IWP) have 
been studied, analyzed and established through real time experimentation. The emphasis has 
been on energy based controllers. Some of the contributions of this work are as follows: 

• A dual mode control scheme based on feedback linearization is revisited. Here, the 
swing up is achieved by partial feedback linearisation control and stabilisation is achieved 
by input-state feedback linearization. 

• A modification has been done to the existing passivity based swing up control which 
enables the pendulum to reach the domain of attraction of balancing controller faster. 
Real time experimental results show a marked improvement in the performance of the 
modified algorithm over the partial feedback linearisation based swing up control. 

• A novel energy based swing up controller in conjunction wjth IDA-PBC stabilizing 
controller has been designed for the IWP. During experimentation with Mechatronics 
kit, this controller was found to be the fastest among all three dual mode Qontrollers. 
The stability and convergence of this algorithm has been established through extensive 
derivations. The experimental results corroborate our theoretical findings. 
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6.2 Scope of Future Work 


• All the controllers, that have been implemented here, are dual mode controllers. So 
one may think of implementing a single algorithm that can achieve both swing up and 
balancing. The IDA-PBC is a single controller that achieves the above objective only in 
simulation. It does not work on real time systems as it requires very high control input 
and disk speed. One can investigate deeper into this algorithm to make it work on the 
real time systems. 

• Motor dynamics has not been taken into account in any of the above mentioned algo¬ 
rithms. By considering motor dynamics, robust controllers can be designed which are 
insensitive to parameter variations and external disturbances. 

• In our work, the system parameters were available to us, which may not be so in many 
cases. Adaptive controllers may be designed to estimate these parameters so that system 
independent controllers can be designed. 

* * * 
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